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Abstract 

In this paper, collocated and distributed space-time block codes (DSTBCs) which admit multi-group 
maximum likelihood (ML) decoding are studied. First the collocated case is considered and the problem of 
constructing space-time block codes (STBCs) which optimally tradeoff rate and ML decoding complexity 
is posed. Recently, sufficient conditions for multi-group ML decodability have been provided in the 
literature and codes meeting these sufficient conditions were called Clifford Unitary Weight (CUW) 
STBCs. An algebraic framework based on extended Clifford algebras is proposed to study CUW STBCs 
and using this framework, the optimal tradeoff between rate and ML decoding complexity of CUW STBCs 
is obtained for few specific cases. Code constructions meeting this tradeoff optimally are also provided. 
The paper then focuses on multi-group ML decodable DSTBCs for application in synchronous wireless 
relay networks and three constructions of four-group ML decodable DSTBCs are provided. Finally, the 
OFDM based Alamouti space-time coded scheme proposed by Li-Xia for a 2 relay asynchronous relay 
network is extended to a more general transmission scheme that can achieve full asynchronous cooperative 
diversity for arbitrary number of relays. It is then shown how differential encoding at the source can 
be combined with the proposed transmission scheme to arrive at a new transmission scheme that can 
achieve full cooperative diversity in asynchronous wireless relay networks with no channel information 
and also no timing error knowledge at the destination node. Four-group decodable DSTBCs applicable 
in the proposed OFDM based transmission scheme are also given. 
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I. Introduction 

Space-Time coding for Multiple Input Multiple Output (MIMO) systems has seen a lot of progress in 
the last decade. Starting from orthogonal designs [1], [2], [3] and quasi orthogonal designs [4], [5], [6], 
[7], [8], several space-time block code (STBC) constructions have been proposed in the literature including 
the recently proposed space-time block codes from division algebras [9], crossed product algebras [10], 
co-ordinate interleaved orthogonal designs [1 1] and Clifford algebras [12], [13], [14], [15]. Several aspects 
of space-time block codes (STBCs) have been studied in the literature. In the high SNR regime, two main 
aspects which dictate the error performance are diversity gain and coding gain. Of these two aspects, 
diversity gain has been well studied and presently many high rate, full diversity STBC constructions are 
available in the literature. An important class of such codes is the ones from division algebras [9]. Coding 
gain has remained an open problem not only for MIMO channels but also for Single Input Single Output 
channels and the AWGN channel. Later few more aspects such as the information lossless property [16] 
and the diversity-multiplexing gain tradeoff [17], [18] were introduced. Explicit STBCs satisfying these 
additional requirements were also obtained from division algebras [19], [20], [21]. However, there are still 
other important issues that need to be addressed. One such important issue is the Maximum Likelihood 
(ML) decoding complexity of STBCs. The lattice decoder or sphere decoder [22], [23] is known to be 
an efficient ML decoder. However, the complexity of a sphere decoder [24], [25] is also prohibitively 
large for high rate STBCs such as those from division algebras. For example, decoding a 4 x 4 STBC 
from cyclic division algebras is equivalent to decoding a 32 dimensional real lattice and performing a 
simulation to obtain an error performance curve can easily take several weeks. Thus it is not practically 
feasible to implement ML decoding for the 'good' performing codes in the literature. It is well known 
[6], [7], [11] that STBCs obtained from orthogonal designs (ODs) using QAM constellation admit single 
real symbol decoding and give full diversity. But for 4 Tx antennas, an OD which provides a transmission 
rate of 1 complex symbol per channel use does not exist [1], [2], [3]. However, it was shown in [6], [7], 
[11], [12] that a single complex symbol decodable (2 real symbol decodable) full diversity STBC for 
4 transmit antennas can be constructed. Later in [8], [13], [14], [15], the general framework of multi- 
symbol decodable or multi-group decodable STBCs was introduced to improve the transmission rate. 
Multi-symbol or multi-group decodable STBCs admit ML decoding to be done separately for groups of 
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symbols rather than all the symbols together thus reducing the ML decoding complexity. The class of 
STBCs from ODs correspond to the case of one real symbol per group. Thus it is clear that there is 
a tradeoff involving rate, ML decoding complexity and number of transmit antennas for full diversity 
STBCs. In the first part of this paper, measures of rate and ML decoding complexity are given and 
the problem of optimally trading off rate for ML decoding complexity within the framework of multi- 
group decodable STBCs is formally posed. A partial solution to this general problem is provided by 
characterizing this tradeoff for a certain specific class of STBCs called as Clifford Unitary Weight (CUW) 
STBCs [12], [13], [14], [15]. An algebraic framework based on extended Clifford algebras is introduced to 
study CUW STBCs. This framework is used to obtain the optimal rate-ML decoding complexity tradeoff 
and also to construct CUW STBCs meeting this tradeoff optimally. Recently in [29], [30], [31], a 2 x 2 
high rate, information lossless STBC with low ML decoding complexity and non-vanishing determinants 
has been discussed. This 2x2 STBC is not a multi-group decodable STBC and such STBCs are not 
considered in this paper. 

The second part of the paper focuses on constructing distributed space-time block codes (DSTBCs) with 
low ML decoding complexity for the Jing-Hassibi protocol [33]. Distributed space-time coding [32], [33] 
is a coding technique for exploiting cooperative diversity in wireless relay networks wherein each relay is 
made to transmit a column of a space-time code thereby imitating a multiple antenna system. There are 
mainly two types of processing at the relay nodes that are widely discussed in the literature: (1) amplify 
and forward and (2) decode and forward. Throughout this paper, we focus only on amplify and forward 
based protocols for three reasons: (1) relay nodes are not required to decode and re-encode, (2) relay 
nodes do not require the channel knowledge for processing (this feature can permit a possible extension 
of the protocol to a completely non-coherent strategy) and (3) simpler processing at the relay nodes. 
In [33], Jing and Hassibi have proposed an amplify and forward based two phase transmission protocol 
for achieving cooperative diversity in wireless relay networks. This protocol essentially employs STBCs 
satisfying certain additional conditions to take care of the distributed nature. We call such codes satisfying 
certain additional conditions as DSTBCs to distinguish them from collocated STBCs. Analogous to the 
case of collocated STBCs, for large number of relays, the ML decoding complexity of DSTBCs becomes 
too prohibitive at the destination and thus is an important issue that needs to be addressed. Most of the 
previous works on DSTBCs [34], [35], [36] fail to address this issue. In [37], full diversity, two-group 
ML decodable DSTBCs were constructed using division algebras. In [38], quasi-orthogonal STBCs were 
proposed for use as DSTBCs for the specific case of 4 relays. In the second part of this paper, using 
the algebraic framework of extended Clifford algebras introduced in the first part, three new classes of 
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four-group decodable full diversity DSTBCs for any number of relays are constructed. 

The Jing-Hassibi protocol assumes that there is perfect symbol synchronization amongst the relay nodes 
and that the signals transmitted from the relays arrive at the same time at the destination. But achieving 
symbol synchronization among the geographically distributed relay nodes is a challenging and difficult 
task in practice. Several works in the literature [39], [40], [41], [42], [43], [44], [45], [46] have recognized 
this as a major bottleneck and have proposed many coding and transmission techniques to mitigate the 
effects of symbol asynchronism. Most of the works based on amplify and forward propose methods to 
achieve full cooperative diversity in asynchronous wireless relay networks, but however fail to address 
the ML decoding complexity issues. In [39], a OFDM based Alamouti transmission scheme is proposed 
to combat the effects of symbol asynchronism. The Li-Xia transmission scheme is particularly interesting 
because of its associated simplicity and low decoding complexity. In this scheme, OFDM is implemented 
at the source node and time reversal/conjugation is performed at the relay nodes on the received OFDM 
symbols from the source node. The received signals at the destination after OFDM demodulation are 
shown to have the Alamouti STBC structure and hence single symbol maximum likelihood (ML) decoding 
can be performed. However, the Alamouti code is applicable only for the case of two relay nodes and 
for larger number of relays, the authors of [39] propose to cluster the relay nodes and employ Alamouti 
code in each cluster. But this clustering technique provides diversity order of only two and fails to exploit 
the diversity available in the network. Motivated by the results of [39], in the third part of this paper 
it is shown that the DSTBCs proposed in this paper can be used along with OFDM to achieve full 
asynchronous cooperative diversity for any number of relays along with low ML decoding complexity. 

Finally it is shown how differential encoding at the source node can be combined with the proposed 
OFDM based transmission scheme to arrive at a new transmission scheme that provides full cooperative 
diversity in asynchronous relay networks with no channel information and also no timing error knowledge 
at any of the nodes. 

The main contributions of this paper can be summarized as follows: 

• A new measure of rate of an STBC is defined and the problem of optimal tradeoff between rate and 
ML decoding complexity within the framework of multi-group ML decodable STBCs is posed. An 
algebraic framework based on extended Clifford algebras is introduced for studying CUW STBCs. 
Using this algebraic framework and tools from representation theory of groups, the optimal tradeoff 
between rate and ML decoding complexity of CUW STBCs is characterized for certain specific 
cases. 

• Constructions of CUW STBCs meeting this optimal tradeoff for the specific cases are also provided. 
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The ABBA construction first proposed in [5] is shown to be a certain specific matrix representation 
of extended Clifford algebras and hence they fall under the class of CUW STBCs. The contributions 
on multi-group ML decodable collocated STBCs are described in Section UU. 

• The Jing-Hassibi protocol [33] is generalized to allow non-unitary matrices at the relays. The neces- 
sary and sufficient conditions needed for DSTBCs to be multi-group ML decodable are identified and 
three new classes of four group ML decodable DSTBCs which achieve full cooperative diversity 
for any number of relays are also provided. To the knowledge of the authors, these are the first 
known DSTBCs that achieve the least possible ML decoding complexity compared to all other 
DSTBC constructions, having the same transmission rate in complex symbols per channel use in 
the literature. This contribution is detailed in Section UlTl 

• The OFDM based Alamouti transmission scheme for 2 relays in [39] is extended to a more general 
transmission scheme that can achieve full asynchronous cooperative diversity for any number of 
relays. Sufficient conditions for a DSTBC to be compatible with the requirements of this OFDM 
based transmission scheme are provided and the four-group decodable DSTBCs in this paper are 
shown to satisfy these sufficient conditions. 

• It is shown how differential encoding at the source node can be combined with the proposed OFDM 
based transmission scheme to arrive at a new transmission scheme that provides full cooperative 
diversity in asynchronous relay networks with no channel information and also no timing error 
knowledge at any of the nodes. All the results based on OFDM for asynchronous relay networks 
comprise Section ITVl 

A. Notation 

Vectors and matrices are denoted by lowercase bold letters and uppercase bold letters respectively. I m , 
m denote anmxm identity matrix and m x m all zero matrix respectively. I and are used to denote an 
identity matrix and an all zero matrix respectively having an appropriate size depending on the context. 
For a set A, the cardinality of A is denoted by |^4|. A null set is denoted by (p. For a matrix, (.) T , (.)* and 
(.) denote transposition, conjugation and conjugate transpose operations respectively. For a complex 
matrix X, the matrices Xi and Xq denote the matrices obtained by taking the real and imaginary parts 
of X respectively. If B is a module over a base ring R, then EucIrB denotes the set of all R linear 
maps from B to B. For sets A\ and Ai, the Cartesian product of A\ and Ai is denoted by A\ x A<i- 
For groups G\ and G2, the direct product of G\ and G2 is denoted by Gi x G2. For vector spaces V\ 
and V2, the tensor product of V% and V2 is denoted by V\ (8> V%. For a vector space V, GL(V) is used 
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to denote the set of invertible linear maps from V to V. 



II. Multi-group ML Decodable Collocated STBCs 

In this section, multi-group ML decodable collocated STBCs are discussed. In Subsection III-AI a 
relation between STBCs and linear space-time designs is given and using this relation, measures of 
rate and ML decoding complexity of STBCs are defined. In Subsection III-BI linear space-time designs 
are classified based upon the classification done in [12] for single complex symbol decodable codes. 
An algebraic framework based on extended Clifford algebras is introduced to study a class of linear 
space-time designs called Clifford unitary weight designs. Using this algebraic framework, the optimal 
tradeoff between rate and ML decoding complexity of STBCs from Clifford unitary weight designs is 
characterized under some conditions in Subsection III-CI 



A. STBCs and Linear Space-Time Designs 

In this subsection, a connection between STBCs and linear space-time designs is established. Using 
this relation, measures of rate and ML decoding complexity of a STBC are then defined. 

Definition 1: A STBC "rf of size T x Nt is a finite set of T x Nt complex matrices. 

Let Nt denote the number of transmit antennas, Nr denote the number of receive antennas and T 
denote the number of channel uses consumed for transmitting a space-time codeword. Then the rate of 
transmission in bits per channel use (bpcu) of a STBC as in Definition Q] is given by log ^J ^ bpcu. In this 
paper, we use a different measure of rate which is motivated by basic concepts of dimension in linear 
algebra. This measure is also indicative of the coding gain of the STBC and several examples of STBCs 
in the literature are discussed to illustrate the significance of the new measure of rate introduced in this 
paper. 

Note that the set of all T x Nt complex matrices is a vector space over the field of real numbers R 
and has a dimension of 2TNt over R. Consider the subspace (ff) spanned by the codewords, i.e., the 
elements of ^ . Let K denote the dimension of ( < ?f) over R and let Ai,i = 1, ... ,K £ (£TxN T ^ e a 
basis for Then every element of ^ can be expressed as J2^=i x i&-\ f° r some Xi, i = 1, . . . , K 6 R. 



If we think of the Xj S ctS real variables and S(s 



Xi X 2 ... XK 



T 

) = Y2i=i x iAi as a matrix 



whose entries are complex linear functions of the real variables, then the STBC & can be expressed as 

<g = {5(s)|s G ^} (1) 

for some finite subset C R^. 
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Definition 2: A linear space-time design (LSTD) S( x\ X2 • • • xk ) °f s i ze T x -^T m rea l 
variables xi, X2, • • • , a^x is a T x iVy matrix which can be expressed as 2~2i=i x i-^i f° r some 
Ai, i = 1, 2, . . . , K € C TxN t which are linearly independent over the field of real numbers. 

The notion of linear independence of weight matrices of a LSTD over R has not been stressed or 
mentioned explicitly in most previous works though it has been implicitly assumed. 

Notice that (OQ) specifies a way to describe STBCs using linear space-time designs (LSTDs) and also 
explicitly provides a method to encode STBCs. From an encoding perspective, the real variables can be 
thought of as modulating the matrices Ai, i = 1,...,K. Hence we call the matrices Ai, i = 1,...,K 
as basis matrices or modulation matrices or weight matrices. The vector of real variables s takes values 



from 



C 



We call £? as the signal set. The connection between STBCs and LSTDs is pictorially 



depicted in Fig. Q] 




Fig. 1. STBCs and linear space-time designs 



Remark 1: Note that for a given STBC the set of basis matrices Ai, i = 1, . . . , K along with the 
associated signal set s$ is not unique, i.e., there may exist another set of basis matrices with some other 
associated signal set that results in the same STBC ^ . Note also that it is not necessary that the basis 
matrices have to be codewords. We shall see in the sequel that the choice of basis matrices and signal set 
controls the encoding as well as decoding complexity. However, it is important to note that K is unique 
to the STBC <i. 
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Thus a STBC can be thought of as a subset of a subspace of dimension K. Thus designing a STBC 
can be done in two steps: First choose a subspace of dimension K (choose a LSTD) and then choose a 
subset of required cardinality (choose the signal set &4~) within the chosen subspace. 

I ) Measure of Rate: 
In this paper, we use the following definition of rate of a STBC. 

Definition 3: Rate of a STBC = dimenh ^ n ((^)) = K di mens i ons per channel use. 
Note that the unit of rate of a STBC according to Definition [3] is dimensions per channel use (dpcu). 
Since there are K real variables which modulate K modulation matrices, we can view it as though we 
are sending K real symbols (one on each dimension) in T channel uses. Alternatively, we can pair two 
real variables at a time and view it as 4*- complex symbols being transmitted in T channel uses. We 
would like to mention that most of the previous works on STBCs follow the convention of measuring 
rate in complex symbols per channel use which in our case is ^ complex symbols per channel use and 
is simply proportional to rate as per Definition [3] Though the terminology of basis matrices and rate 
have been used previously in the literature (for example see [26]), to the knowledge of the authors, rate 
of a STBC has not been defined explicitly and clearly as in Definition [3] although many works in the 
literature may be measuring rate in a similar way. Note that if linear independence of basis matrices is 
not retained and if rate were to be measured by simply counting the number of complex variables in the 
LSTD, then one can claim to have any arbitrary rate of transmission which can be quite deceptive at 
times. The notion of linear independence makes things clear and avoids such confusions. Definition [3] is 
particularly useful because it essentially allows to define rate of a LSTD, hence allowing us to separate 
the study of LSTDs from STBCs. Also, we argue that rate as per Definition [3] is a first order indicative of 
coding gain and hence is a parameter which has to be maximized. Intuitively, the higher the dimension, 
the more efficiently we can pack codewords in it optimizing some criteria. One of the criteria of interest 
is to maximize the coding gain which is given by minc 1 ,c 2 e^' det ((Ci — C>2) H {C>i — ^2])- 

Recall that even in the case of classical linear error correcting codes over finite fields, rate was defined 
as the ratio of the dimension of the subspace spanned by the codewords to the number of channel uses. 
In the case of classical linear error correcting codes, the code itself is a subspace whereas in the case 
of STBCs, the code is a subset of a subspace. The following examples of existing STBCs reinforce the 
statement that rate as per Definition [3] is a first order indicative of coding gain. 

Example 2.1: Let us consider the Alamouti code [27] and the Golden code [28] which are given by: 

X\ + 1X2 —X3 + IX4 



X3 + IX4 Xl — 1X2 



and 
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(x\ + 1x2)0 + (X3 + 1x4)06 (X5 + ix§)a + {x-j + ixg)a6 
i((x5 + ixo)a + (x-j + ix$)a6) (x\ + 1x2)0. + (23 + 1x4)06 



respectively where, 6 



2 ' 



*~2 ' a = 1 + *(1 — Q) an d a = l + i(l — (9). In both cases, the real variables are allowed to take 
values independently from a finite subset of Z. It can be checked that there are 4 basis matrices for the 
Alamouti code and 8 basis matrices for the Golden code. Thus the rate of Alamouti code and Golden 
code are 2 dpcu and 4 dpcu respectively and it is well known [28] that the Golden code outperforms the 
Alamouti code when they are both compared with the same transmission rate in bpcu. 



Example 2.2: Let us consider the 4x4 OD and the 4x4 quasi orthogonal design. They are given by: 

x\ + 1x2 — X3 + 1x4 — X5 + ixq 



X3 + 1x4 x\ — 1x2 







-x 5 + ix 6 



x 5 + ix 6 





%i — 1X2 


X3 — 1x4 





x 5 + ix 6 


— X3 — 1x4 


X\ + 1X2 


x\ + ix 2 


-x 3 + 1x4, 


x 5 + ix 6 


-x-j + ix 8 


x 3 + IX4 


X\ — 1X2 


x-j + ix s 


x 5 - ix 6 


x 5 + ix 6 


-x-j + ix s 


X\ + 1x2 


—X3 + 1x4 


x 7 + ix s 


x 5 - ix Q 


X 3 + IX4 


X\ — iX2 



and 



respectively. Their respective rates can be verified to 



be I dpcu and 2 dpcu respectively. STBCs from quasi orthogonal designs are known to outperform 
STBCs from ODs [4], [5] for the same transmission rate in bpcu. 



The above examples show that given two STBCs having the same number of codewords, the one having 
higher rate as per Definition [3] outperforms the other in most cases, thus providing a good motivation for 
Definition [3] 



2) Measure of ML decoding complexity: 
Towards defining a measure for ML decoding complexity, let us first define a measure of encoding 
complexity. If we use (Q]) for encoding a STBC using LSTDs, we see that in general one needs to choose 
an element from srf and then substitute for the real variables x%, X2, ■ ■ ■ , xk in the LSTD. This method 
of encoding clearly requires a lookup table (memory) with entries. However, if the signal set si 
is a Cartesian product of g smaller signal sets in dimension then the complexity can be reduced. 
To be precise, if sf = s/\ X s/2 X • • • X s/ g where each s/i C M » with cardinality \s/\s, then the 
STBC ^ itself decomposes as a sum of g different STBCs, which is shown below. Let K = g\. Then 
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■0 



by appropriately reordering/relabeling the real variables we can assume without loss of generality L| that 

S(s) = E£i XiAi = Si(si) + S 2 (s 2 ) + • • • + S g (s g ) where, Si(s;) = £}i (i _ 1)A+1 x i A i and 

1 T 

. Xi\ ■ Hence the STBC decomposes as ^ = J2i=i ^ where, 



x (i-l)\+l x {i-l)\+2 



Vl = {Si(si)|si G M} 
^2 = {S 2 (s 2 )|s 2 G ^ 2 } 



^ 9 = {Sg(sg)|s g €^} 
Definition 4: [54] A STBC # = {5(s)|s G ^ C R K } is said to g-group encodable or y real symbol 

K — 

encodable (or ^ complex symbol decodable) if m = sx\ x x • • • X &/ g where each rfjCM» with 
i 

cardinality \s>/\ » . 

The encoding of a g-group encodable STBC is pictorially shown in Fig. [2] Thus the encoding com- 
plexity of g-group encodable STBCs is g(\ , Ta\ s ). Note that in addition if stf\ = = • • • = srf g then the 
memory required for encoding is also minimized. 



Bits 



Bits-> s 2 



Bits-> s e 



Sx(s x ) 






S 2 (s 2 ) 




• 
• 
• 
• 


S g (s g ) 


— 2~2i=(g-l)\+l X iAi 



S(s)=ELiSi(si) 



Fig. 2. Encoding for a g-group encodable STBC 



Example 2.3: Consider the example of the Golden code which was discussed in Example 12.11 As per 
Definition HI the Golden code is 8-group encodable or single real symbol encodable. 

Thus we have seen how a g-group encodable STBC decomposes into a sum of g STBCs 
"jfi, i = l,...,g and thus admits independent encoding of the %,'s. A natural question that follows is: 
Under what conditions does a g-group encodable STBC admit independent decoding of the constituent 
^'s? Towards that end, let us look at the ML decoding metric. Let X be the transmitted codeword of 

'Here we have assumed that the first A real variables belong to first group and the second A real variables belong to the 
second group and the last A real variables belong to the g-th group. In general, the partitioning of real variables into g-groups 
can be quite arbitrary. 
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size T x Nt, H be the Nt X Nr channel matrix and Y be the received matrix of size T x Nr. Then, 
the ML decoder is given by 



X 



are min II Y — XH || F 



(2) 



For a g-group encodable STBC V, X = £)f =1 X ; for some X ; G ^. It can be shown [11], [12], [13] 
that if the basis matrices Ai, i = 1, . . . , K satisfy the condition 



Ai^Aj + Aj^Ai = whenever A; G (tf k ), Aj G k^l 
then the ML decoder decomposes as 



(3) 



X 



i=i 



are; min || Y 



X ; H 



(4) 



In other words, the component STBCs ^'s can then be decoded independently. It can also be shown 
[11], [12], [13] that ([3]) is a necessary condition for this to happen. 

Remark 2: Note that the subspaces = 1,. . . ,K intersect trivially, i.e., (^) n (%) = 0. Thus 

Crf) = ^€\) © (^2) © • • • © ffig}- If the condition in (f3]) is satisfied for the basis matrices, then it implies 
that A H B + B^A = 0,V A G {%),B G (%), k ^ I. In other words, this becomes a property of the 
two subspaces (ffk) and (%}■ 

Definition 5: [54] A STBC ^ = {S(s)|s G £? C M^} is said to g-group decodable or y real symbol 
decodable (or ^ complex symbol decodable) if ^ is g-group encodable and if the associated basis 
matrices satisfy ([3]). 

Example 2.4: All STBCs obtained from ODs are single real symbol decodable if every real variable 
in the OD takes values independently from a PAM (Pulse Amplitude Modulation) signal set. As an 
example, consider the Alamouti code that was previously discussed in Example 12. lj The associated basis 





1 







* 




-1 




i 




matrices are Ai = 





1 


, A 2 = 


-i 


, A 3 = 


1 


and A4 = 


i 


. It can 



checked that they satisfy the condition in ([3]) for g = 4. In this case, Si(si) = X1A1, 82(82) = X2A2, 
83(83) = X3A3 and 84(84) = X4A4. Hence the Alamouti code is single real symbol decodable. 

The ML decoding for a g-group decodable code is illustrated pictorially in Fig. [3] It is clear that 

1 

the decoding complexity is reduced for g-group decodable STBCs from |^| computations to gffil 9 
computations. Further, we know that the sphere decoder [22], [23] is an efficient ML decoder if vector 
s takes values from a lattice constellation. Moreover, it has been shown [24], [25] that the average 
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= argmin Xl e%?i 


i v 
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A 1 1 1 


12 

If 














— - 


x 2 


= argmin X2 e% 


Y 


-X 2 H 


|2 

If 






• 
• 
• 








— - 




= argmin Xg e%? 9 


1 Y 


-X g H 


1 1 2 
\\F 




Fig. 3. ML decoding for a g-group ML decodable STBC 



complexity of a sphere decoder depends on the dimension of the equivalent lattice [23] and more or less 
independent of the size of the code. Thus, we can take the dimension of the corresponding equivalent 
lattice as a measure of the sphere decoder complexity. For a general STBC, this dimension is equal to 
K whereas for g-group ML decodable STBCs, it is y = A. Thus the expected as well as the worst case 
ML decoding complexity is lesser for (/-group ML decodable STBCs. 

3) Full diversity: 

Apart from rate and ML decoding complexity, yet another important aspect of STBCs is the diversity 
gain. Diversity gain is a measure of the slope of the error probability versus the SNR when plotted on 
a log-log scale and this is given by A^minc^Cae'*? rank(Ci — C2)). Thus full diversity of NrNt is 
achieved by a STBC if the coding gain is not equal to zero. 

4 ) Problem statement of optimal Rate-ML decoding complexity tradeoff: 

Having surveyed three important aspects of rate, ML decoding complexity and diversity for a STBC, 
we can now pose the problem of rate-ML decoding complexity tradeoff. This problem can be formally 
stated in two equivalent ways which are listed down as given below. 

1) Given A, T and N t what is the maximum rate of any full diversity STBC? 

2) Given g, T and N t what is the maximum rate of any full diversity STBC? 

If A = 1 and Nt = T, then the solution is precisely the STBCs from square orthogonal designs 
constructed in [2], [3] for which the maximum rate is ^"f^^li dpcu. In this paper, the maximum rate 
of a certain class of full diversity square STBCs from Clifford unitary weight designs is characterized 
for A = T. 

The following example illustrates that full diversity and encoding/decoding complexity are related 
indirectly. 

Example 2.5: Consider the 4 x 4 co-ordinate interleaved orthogonal design (CIOD) [11] given by 
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x\ + ix2 —23 + 1x4 
23 + 1x4 x\ — 1x2 

. The weight matrices of the above LSTD satisfy 

X5 + zxq —xj + ixg 

x-j + ixs 25 — zxq 

A{ H Aj + A^Ai = 0, V % 7^ j. Let the notation A stand for the codeword difference matrix. Note that 

det (AS H AS) = fet=i A ^ 2 ) (EL5 Aa;2 ) > which wiU e q ual to zero for 

some pair of codeword 

matrices if all the 8 real variables are allowed to take values independently. Hence, it is not possible to 
obtain a full diversity single real symbol decodable STBC from the above LSTD. However, by entangling 
two real variables, as for example {21,25}, {22,23}, {23,2:7}, {24, x$} and then allowing them to take 
values from a rotated QAM constellation (rotating a QAM constellation entangles the variables), a full 
diversity, single complex symbol ML decodable STBC can be obtained [11]. The resulting STBC will 
be 4-group ML decodable or 2-real symbol ML decodable and its associated four constituent STBCs are 
given by Si(si) = x 1 A 1 + 25A5, S 2 (s 2 ) = x 2 A 2 + 2 6 A 6 , S 3 (s 3 ) = x 3 A 3 + x 7 A 7 and S 4 (s 4 ) = 
24A4 + x 8 A 8 . 

Example 12.51 shows that the requirement of full diversity can sometimes demand an increase in the 
encoding complexity and hence the decoding complexity even if the associated weight matrices satisfy 
condition ([3]) for A = 1. Thus, it is clear that full diversity and encoding/decoding complexity are inter- 
related and there exists a tradeoff between the two. 



B. Clifford Unitary Weight Designs and extended Clifford algebras 

First, let us classify square LSTDs (as done in [12] for single complex symbol decodable codes). 
LSTDs can be broadly classified as unitary weight designs (UWDs) and non unitary weight designs 
(NUWDs). A UWD is one for which all the weight matrices are unitary and NUWDs are defined as 
those which are not UWDs. Clifford unitary weight designs (CUWDs) are a proper subclass of UWDs 
whose weight matrices satisfy certain sufficient conditions for g-group ML decodability. To state those 
sufficient conditions, let us list down the weight matrices of a CUWD in the form of an array as shown 
in Table H 

For simplicity, the grouping is assumed to be as follows: All the weight matrices in one column belong 
to one group. The weight matrices of CUWDs satisfy the following sufficient conditions for (/-group ML 
decodability. 

1) A 1 = I. 

2) The unitary matrices in the first row except Ai should form a Hurwitz-Radon family [1], [2], [3]. 
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TABLE I 

Structure of CUWDs 



Ai 


A A+ i . 


A( g _i)A+l 


A 2 


Aa+2 


A( g _ 1)A + 2 


A A 


A 2A . 


A K 



In other words, all the matrices in the first row except Ai should square to —I and should pair-wise 
anti-commute among themselves. 

3) The unitary matrices in the first column should square to I and should commute with all the 
matrices in the first row and first column. 

4) The unitary matrix in the i-th row and the j-th column is equal to ±AiAy_ 1 )\ i+1 . 

It can be checked that the above four conditions together imply that the necessary and sufficient 
condition for g-group ML decodability in © is satisfied and hence any CUWD is g-group ML decodable. 
Note that when A = 1, CUWDs become ODs [1], [2], [3]. Similarly the co-ordinate interleaved orthogonal 
designs proposed in [11] are a proper subclass of NUWDs. The single complex symbol ML decodable 
STBCs in [6] are also CUWDs [12]. Fig. |4] pictorially shows the broad classification of LSTDs. 

1) Full diversity lattice constellations for Clifford Unitary Weight designs: 
An important advantage of CUWDs is that full diversity STBCs can be obtained from them without 
increasing the encoding/decoding complexity contrary to the case of CIODs (see Example 12.51 ) wherein 
real variables from different groups have to be entangled for full diversity. Moreover, explicit lattice 
constellations that optimize the coding gain can be obtained for CUWDs, thus admitting the use of a 
lattice/sphere decoder. In [12], [13], [14], [15], few constructions of CUWDs are available and the aspect 
of full diversity has been addressed in detail. In this paper, we only provide a brief outline of the basic 
idea (described below) and illustrate the procedure with an example. Later in the proof of Theorem |6l 
we also provide a new construction of CUWDs. 

For a CUWD, det (AS(s) H AS(s)) = det (£f AS i (s i ) H AS i (s i )). If det (AS i (s i ) H AS i (s i )) > 
0, Mi = 1, . . . , g then, we have 

9 

det (AS(s) H AS(s)) > J^det (ASi(si)^ASi(si)) . 

i=l 
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Fig. 4. Classification of LSTDs 



Thus it is sufficient to construct full diversity lattice constellations independently for each of the con- 
stituent LSTDs, i.e., Si(si)'s and this will ensure g-group ML decodability. Note that 

Si(si) = A (i _ 1)A+1 (x (i _ 1)A+1 I + x {i _ l)x+2 A 2 + h x a A A ) = A (i _ 1)A+1 Si(si) which implies 

det (ASi(si)^ ASi(si)) = det (ASi(si) J/ ASi(si)). Hence without loss of generality, we can consider 
the construction of full diversity lattice constellations for the LSTD Si(si) since the same lattice 
constellation will ensure full diversity for the remaining constituent LSTDs Si(si), i = 2,...,g. We 
have Si(si) = 2~2i=i x i&-\- Note that the matrices Ai, i = 1, . . . , A are unitary, square to I and pairwise 
commute among themselves. Hence they are simultaneously diagonalizable by some unitary matrix U 
to result in diagonal unitary matrices Di,D2, . . . ,D A . All these diagonal matrices will continue to be 
linearly independent over E and since all of them square to I, the diagonal entries of Di, i = 1, . . . , A 
are ±1 and Di = I. Thus the LSTD USi(si)U H = J2i=i x i^\ becomes a diagonal matrix for which it 
is easy to compute the determinant and also find the lattice constellation that will provide full diversity. 
This procedure is illustrated in the following example. 

Example 2.6: Consider a CUWD for Nt = 8, A = 4, g = 4 and K = Xg = 16 given by 
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s (s) = E'=i A i where > Ai = I 8 , A 2 



A 4 = A 2 A 3 , A 5 







o 2 i 2 o 2 2 

I 2 2 2 2 
2 2 2 I 2 
2 2 I 2 2 


i 
i 
i 
i 
i 
% 
i 



A 



9 



, A 3 

1 

-1 



0-10 











2 2 I 2 2 

2 2 2 I 2 

I 2 2 2 2 

2 I 2 2 2 





1 



1 

0-10 00 

1 

-1 



^13 



% 


























—i 


























% 


























—i 


























i 


























— i 


























i 


























— i 



and A 4 ; + j 



A^+iAj, 



1,2,3, j = l, 



A. 



It can be checked that the above listed basis matrices satisfy all the requirements of a CUWD for 
A = 4, g = 4. For the purpose of finding full diversity lattice constellations, it is enough to construct 
full diversity signal sets for the LSTD Si(si) = Ylt=i Ai- Since the matrices Ai, i = 1, . . . , 4 mutually 
commute among themselves and square to Is, they can be simultaneously diagonalized by a unitary 



matrix U = 



h- > 


h- 1 


1 1 


1 


1 


-1 


1 -1 


1 -1 


1 


-1 


1 


-1 -1 


1 1 


-1 


-1 


-1 


-1 1 


1 -1 


-1 


1 


1 


1 1 


-1 -1 


-1 


-1 


-1 


1 -1 


-1 1 


-1 


1 


1 


-1 -1 


-1 -1 


1 


1 


-1 


-1 1 


-1 1 


1 


-1 



which in this case turns out to be the 8 x 8 
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Hadamard matrix. Defining Di = UAiU^, i = 1, . . . , 4, we get Di = Is, 



diag 



1 1 



and D 4 = diag 



1 1 



-111- 
-1 -1 -1 

det(ASi 



-1 -1 
-1 1 1 



|,D 3 = diag | 



1111 



1 -1 



where, 



A<?i Aq 2 Aq 3 Aq 4 



j. Thus we have 

(A qi ) 2 (Aq 2 )\A q3 )\Aq,f 

-iT 

Axi Ax 2 Ax 3 Ax 4 andP 



1 


1 


1 


1 


1 


-1 


1 


-1 


1 


1 


-1 


-1 


1 


-1 


-1 


1 



Thus full diversity will be achieved if Aqi ^ 0, V i = 1,...,4. This can be guaranteed by letting 

T 

Si 



X\ X 2 X 3 2 4 



take values from P _1 C/Z 4 where, Q is the generator matrix of a lattice 
designed to maximize the product distance [47], [59]. 

2 ) Extended Clifford Algebras: 
Towards constructing and studying CUWDs an algebraic framework of extended Clifford algebras is 
first established. Using this algebraic framework, the optimal tradeoff between rate and ML decoding 
complexity of CUWDs is obtained in Subsection III-CI Furthermore, algebraic descriptions for the ABBA 
construction [5] and the tensor product based construction in [14] are provided using extended Clifford 
algebras. 

First observe that in order to construct CUWDs it is sufficient to construct the weight matrices in the 
first row and first column (as discussed in Subsection III-BI and Table Our methodology to construct 
the weight matrices in the first row and first column would be to fabricate an algebra in such a way 
that it contains elements satisfying the algebraic relations we need. Once we construct such an algebra, 
we then obtain the required CUWD by taking an appropriate matrix representation of the constructed 
algebra. Recall that an algebra over a field is simply a ring as well as a vector space with the addition 
operation being compatible to both the ring and the vector space structures. Let us recall certain basic 
definitions from algebra which will be useful in the sequel. 

Definition 6: A nonempty set B equipped with two binary operations called addition and multiplication 
denoted by + and . is called a ring denoted by (B, +, .) if 

1) (B, +) is a Abelian group 

2) (B, .) is a monoid with multiplicative identity 1 

3) x.(y + z) = x.y + x.z, V x,y, z G B 

4) (x + y).z = x.z + y.z, V x,y, z £ B 



November 22, 2008 



DRAFT 



18 



Definition 7: A nonempty set A equipped with two binary operations called addition and multiplication 
denoted by + and . is called a right module algebra over a ring B if 

1) (A, +, .) is a ring 

2) There is a map (x, a) — > xa of A x B into A satisfying the following for all a, (3 € B and x, y € A. 

(x + y)a = xa + ya 
x(a + 8) = xa + x(3 

(5) 

x{af3) = {xa)B 
xl = x 

Note that in the standard mathematical literature (for example [64]), algebra is usually defined over 
a field. Since our definition differs from the definition in [64], we have given the name 'right module 
algebra' in order to distinguish it from the concept of algebra over a field. 

Definition 8: [2] The Clifford algebra, denoted by Clif f n is the algebra over R generated by n 
objects 7^, k = l,...,n which are anti-commuting (jklj = —Ijlk, VA; / j) and squaring to — 1 
( 7 2 = -l Vfc = l,...,n). 

A natural basis for Clif f n seen as a vector space over R is 

^n = {l}\jH\i = l,...,n} |J I [jTfcJl < h< h+i < n\ . (6) 

m=2 U=l J 

The number of basis elements is \^ n \ = 2 n . 

Example 2.7: Clif fo is nothing but the set of real numbers R, Clif fi is the set of complex numbers 
C and Clif f'2 is the Hamiltonian Quaternions denoted by EL 

The reason we are interested in Clifford algebras is that the defining algebraic relations of the generators 
of a Clifford algebra resemble the algebraic relations which the matrices in the first row of a CUWD need 
to satisfy. Hence we can obtain the matrices in the first row by taking unitary matrix representations of 
the generators of a Clifford algebra. To obtain the matrices in the first column, we use a similar strategy. 
We introduce few new symbols in the Clifford algebra and define them to square to 1, commute with the 
generators of the Clifford algebra and also commute among themselves. In other words, after introducing 
new symbols, multiplication in the algebra is appropriately defined in order to create a bigger algebra 
which contains Clifford algebra as a sub-algebra. Hence by taking a unitary matrix representation of 
these specific elements of the algebra, we get the weight matrices of the required CUWD. We give the 
name 'extended Clifford algebras' to the so constructed algebras: 

Definition 9: Let L = 2 a , a G N. An extended Clifford algebra denoted by is the associative 
algebra over R generated by n + a objects 7^, k = 1, . . . , n and 8{, i = 1, . . . , a which satisfy the 
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following relations: 

• 7fe = — 1, Vfc = l,...,n 

• 7fc7i = -7j7A> V k^j 



. ^ = 1, VA: = l,...,a 

• <Wj = <5j'^fcj V 1 < < a 

• <^7j = 7j^fc, V 1 < K a, 1 < j < n 

From the above definition, it is clear that Cliff n (or A*) is a sub-algebra of A^. Let 
basis for this sub-algebra Cliff n . Then a natural R basis for A^; is given by 



be the natural 



(V) 



3Bl = 3B n U {m n h\i = l,...,a} (J ^n<n<yi<ki< fci+i < a \ . 

m=2 Li=l J 

Thus the dimension of A^; seen as a vector space over R is 2 n+a . 

The algebra A^ over R can also be viewed as a right module algebra over the base ring Clif f n . We 
will use this fact later in subsection III-CI 

Example 2.8: Let us take n = 2, a = 1. Hence L = 2. Then 

^2 = {°i + 7i a 2 + <5i03 + 5i7ia 4 |ai,a 2 ,a3,a4 £ R} ■ 

Addition in the algebra is defined to be component wise and multiplication is completely described 
by defining the multiplication between any two basis elements. The multiplication table can be easily 
generated using the defining algebraic relations of the generators and is given as follows. 





1 


7i 


Si 


S1I1 


1 


1 


7i 


Si 


Siji 


7i 


71 


-1 


Sin 


-Si 


Si 


Si 


<5i7i 


1 


7i 


<5i7i 


<5i7i 


-*i 


7i 


-1 



One can check from the multiplication table that the multiplication is indeed associative. Note that Af 
can also be viewed as a vector space over C by viewing the symbol 71 as the complex number i = 
Then, we have A^ = {zi + b\Z2\z\, Z2 € C} where, z\ = a\ + 7102 and Z2 = 03 + 7104. 

From the defining relations of the generators of the extended Clifford Algebra, it can be observed that 
the symbols 1, 71, 72, . . . , 7 ra satisfy relations similar to that satisfied by the weight matrices that we 
need in the first row (squaring to —1 and anticommuting). Similarly, the symbols 5k, k = 1, . . . ,a, and 
Um=2 nUi Ski f° r 1 — k% < ki + \ < a satisfy relations similar to that satisfied by the weight matrices 
that we need in the first column (squaring to 1 and commuting with all other elements). Thus, for the 
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case of A = 2 a , when the weight matrices of any CUWD are listed down in the array form as shown in 
Table Jl the matrices in the first row will simply be matrix representations of the symbols 1, 71, 72, . . . , 
7 n of an extended Clifford Algebra. Similarly, the matrices in the first column are nothing but matrix 
representation of the symbols 5k, k = 1, . . . , a, and Um=2 112=1 ^fc f° r 1 — ^* — < a of an extended 
Clifford Algebra. 

C. Optimal Rate-ML decoding complexity tradeoff of Clifford Unitary Weight codes 

The maximum rate problem of CUWDs can be formally stated in many equivalent ways. Some of 
them are listed as follows. 

1) Given A and N t what is the maximum rate? 

2) Given g and N t what is the maximum rate? 

3) Given g and A what is the minimum value of iVy? 

For A = 2, the solution to the first question is reported in [12]. In this subsection, the solution to 
question number [3]) for A = 2 a , a G N is provided. Using the algebraic framework of extended Clifford 
algebras introduced in the previous subsection, the maximum rate problem can be restated in algebraic 
terms as follows. 

What is the minimum matrix size Nt in which the algebra A^_^ has a non-trivial matrix represen- 
tation ? 

This problem appears to be difficult to solve directly. Hence, we take an alternate approach which is 
similar to the approach in [2] wherein matrix representations of Clifford algebras were obtained using 
matrix representations of the Clifford group. First, we find a finite group with respect to multiplication 
in the algebra such that it contains the elements of the natural M-basis of denoted by 

^\ g -\y Then, we find a suitable representation of this finite group such that it can be extended to a 
representation of the algebra. 

Proposition 3: The set of elements G = ^\ g -\) U | — b\b £ ^(5-1)} * s a ^ n ^ ie group with respect to 
multiplication in Aj^ ^. Further, the group G is a direct product of its subgroups G 7 and Gg, where 

G 7 = %_ 1} U{-6|6 6 %-i)}, (g) 
G 5 = G Sl x Gs 2 x • • • x G Sa 

and G de ita z ={l,5i}, i = l,...,a. 

Proof: The multiplication in G is associative and the unit is 1. The inverse of the element db YiiLi 7fci 
is ±(— 1)1" 2"! ni^i Iki- The inverse of the element IX^i is itself. Similarly, it is easy to find the inverse 
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of the other elements. The set G 7 is nothing but the well known Clifford group [2]. The set Gs t is the 
cyclic group of order two (denoted by C 2 ) with generator Si. The set G$ is a group since it is the a times 
direct product of C 2 . The group G is a direct product of G 7 and Gg because: 

1) Each s£G can be written uniquely in the form s = S1S2 with si G G 7 and S2 £ G<5. 

2) For all si G G 7 and S2 £ G«5, we have S1S2 = s 2 Si. 

■ 

Thus the problem is simplified to finding the matrix representations of this finite group G. Towards 
that end, we quickly recall some basic concepts in linear representation of finite groups. We refer the 
readers to [65] for a formal introduction. 

Definition 10: [65] Let G be a finite group with identity element 1 and let V be a finite dimensional 
vector space over C. A linear representation of G in V is a group homomorphism p from G into the 
group GL(V). The dimension of V is called the degree of the representation. 

Few basic results in representation theory are as listed below. 
[Rl ]: Irreducible representations are representations with no invariant subspaces. 
[R2 ]: Every representation is a direct sum of irreducible representations. They are equivalent to block- 
diagonal representations, with irreducible representation matrices on the block diagonal. 
[R3 ]: Two representations R and R' of G are equivalent, if there exists a similarity transform U so 
that 

r\ x ) = U-^Ri&U, VieG 

[R4 ]: Unitary representations are representations in terms of unitary matrices 
[R5 ]: Every representation is equivalent to a unitary representation 
Theorem 4: [65] All the irreducible representations of an Abelian group have degree 1. 
Lemma 1: [65] Let p\ : G\ — ► GL{V\) and p2 : G2 — ► GL(V2) be linear representations of groups 
G\ and G2 in vector spaces V\ and V2 respectively. Then p\ % p2 is a linear representation of G\ x G2 
into Vi (8) V 2 . 
Theorem 5: [65] 

1) If pi and p2 are irreducible, then p\ <g) p2 is an irreducible representation of G\ x G2. 

2) Each irreducible representation of G\ x G2 is equivalent to a representation p\ % p2, where pi is 
an irreducible representation of Gi, i = 1,2. 

Now, having introduced the necessary tools, the problem is to find unitary matrix representations of the 
finite group G. Before we proceed, note that when G is interpreted as a finite group, the representation 
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of —1 does not necessarily have anything to do with —1 times identity matrix and similarly for a generic 
—b, b G SS^g^y Such a representation p, where p{— 1) ^ — p(l) is said to be a degenerate representation. 
Degenerate representations are not representations of the algebra A^_^. Thus we are interested in the 
smallest degree non-degenerate unitary representation p of the finite group G such that the representation 
matrices of the required elements of G are linearly independent over R. The following lemma will help 
to prove the linear independence of complex matrices over R. 

Lemma 2: A set of complex matrices A;,i = 1, ... ,K 6 £TxN T are ii ne arly independent over R if 
Tr(Ai H Aj + Aj H A ; ) = 0, V i + j. 

Proof: The linear map from C Tx Nt x C TxNt i-> R given by iTr(A H B+B H A) for A, B <G C TxNt 
is an inner product. The statement of the lemma then follows. ■ 

Theorem 6: The maximum rate of a CUWD for A = 2 a , a 6 N and arbitrary g is equal to ^g-v } 
dpcu. 

Proof: Proof is by induction on a. The proof proceeds to find the smallest degree non-degenerate 
unitary representation p of G such that the following condition is satisfied. 



p(x)^±p(y), \/ x^yeG (9) 

The above condition is required, since otherwise, the representation matrices will be linearly dependent 
over R. However, even if the above condition is satisfied, linear independence is still not guaranteed. 
Therefore, we can only obtain an upper bound on the rate but we shall see that a representation meeting 
the upper bound actually provides us with linear independence as well. 

For a = 0, CUWDs become ODs and the maximum rate for square ODs is well known [2] and the 
theorem holds true. For a = 1, A = 2 and the group G = G 7 x G$ 1 where G$ 1 = {1, <5i}. Since we are 
interested in the smallest degree representation of G, let us first study the irreducible representations of 
G. From Theorem [5J all irreducible representations of G are obtained as a tensor product of irreducible 
representations of G 7 and G^. All irreducible representations of G 7 have been studied in [2]. There are 
2 non-degenerate irreducible representations of G 7 in dimension 2 L" 2 ^ J . The representation matrices of 
the (g — 1) generators of G 7 are given as follows [2]: 
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R{l2k) 
R{l2k) 



(7 3 <g> • • • ® (7 3 , = 1,2, . 

, ' 

k-1 

I2 8) • • • <8> I2 <8>0-2<g> (T 3 (g) • • • (g) (j 3 , A; = 1,2, . 




if - 1 - k 

dbi cr 3 (8) • • • (8) cr 3 

v v ' 

if - 1 



fc-1 





1 




i 




1 




where, ui = 


-1 




i 


, cr 3 = icricr 2 = 


-1 


and if = | 



I if g is even 
if <? is odd 

The notation R(.) is used to denote the representation matrix. Also note that Tr(<7j) = 0, i = 1,2,3. 

Since the two non-degenerate representations are in the same dimension, without loss of generality, 
let us consider one of them and denote it by p$. By Theorem 01 all the irreducible representations of 
Gs 1 are in dimension 1 since the group Gs 1 is Abelian. Recall that G$ 1 is nothing but the cyclic group 
C2 of order two. Apart from the trivial representation (all elements are mapped to 1), the only other 
irreducible representation of the order two cyclic group Gs 1 is given by: R(l) = 1, R(5\) = —1. Note that 
(— l) 2 = 1 and hence —1 is the generator. Thus we get two non-degenerate irreducible representations 
of G in dimension 2 J denoted by Ri and R2 respectively and they are given by: 

1) Ri(li) = po(li),i = 1, ■■■,(9 ~ !)> Ri(Si) = Im, Ri(Sili) = Po{ji),i = l,...,(g- 1) 

2) #2(7*) = Po(li),i = !)••• >(fl - 1). ^2(^1) = -I m , R2{8ili) = -Po{li),i = 1, ••• ,{9- 1) 
where, m = 2L fl i _ J. But both the non-degenerate irreducible representations of G fail to satisfy condition 
©. Thus we seek non-degenerate reducible representations of G that satisfy (O. From property [R2], 
we have that reducible representations can be easily obtained by placing irreducible representations as 
blocks on the diagonal. If degenerate irreducible representations are placed as blocks on the diagonal then 
it is easy to check that the resulting representation will also be degenerate. Thus only non-degenerate 
irreducible representations can be placed as blocks on the diagonal to construct non-degenerate reducible 
representations of G. It then follows that the smallest degree non-degenerate representation p% satisfying 
© for a = 1 is 2(2 L j2L ^ — J ) and the corresponding basis matrices we need are explicitly given as follows: 



Ai — l2m, A 2 i+1 



Poili) 
po(li 



1,2,. . .,(<?-!), A 2 







Now using the identity Tr(A <g) B) = Tr(A) x Tr(B), it can be easily checked that the above basis 
matrices are trace orthogonal, i.e., Tr(A^Aj + A^Aj) =0, V i 7^ j and hence by Lemma [2] they are 
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linearly independent over R. Thus the theorem is true for a = 1. Now let us assume that the theorem is 
true for a = n — 1 and prove the theorem for a = n. 



For the case of a = n, note that the corresponding G can be expressed as G = G n _i x Gg n where, 
G n _i = G 7 x x G# 2 x • • • x G$ n _ 1 and G5. = {1, Si} , i = 1, . . . , n. Once again invoking Theorem 
|5J we have that the irreducible representations of G are a tensor product of irreducible representations 
of G n _i and Gg n . Now using Theorem [4] the non-degenerate irreducible representations of G are in 
dimension 2 L 2 J. Since they do not satisfy ([9]), we look for non-degenerate reducible representations 
whose degree has to be a multiple of 2 L j2 ^ — J . By induction hypothesis, the smallest degree non-degenerate 
representation which results in linearly independent basis matrices for a = n — 1 is 2 n - 1 (2L a r L J). Let 
it be denoted by p n -\. Since the representation p n -\ is also a representation of G n _i, using analogous 
arguments as made for a = 1 it follows that the smallest degree non-degenerate representation p n 
satisfying (J9]) for a = n is in dimension 2 n (2'-"V J 'J) and the corresponding basis matrices are given by: 



A; 



Pn-lU) 

p n -l(l 
/?n-l(<5*-l) 

p n _i(5i_i 



i+l 



Pn-lili) 

Pn-lili) 

...,(n-l),A n = 



* = !)••• ,(fl- 1). 

Pn-l(l) 

-Pn-l(l) 



Once again it can be shown that the above basis matrices are linearly independent over R by using 
Lemma [2] ■ 



Theorem [6] essentially answers the question: For a CUWD, given g and A, a power of two, what is 
the minimum matrix size Nt that it can have? The answer to this question is given by A ^V~H^. The 
following example highlights the fact that the maximum rate expression of a CUWD given in Theorem 
[6] does not depend on A. 



Example 2.9: For g = 4, let us study CUWDs for two cases A = 1 and A = 2. 
Case 1: A = 1, g = 4 

The minimum possible dimension in which a CUWD with these parameters exists is given by Theorem 
6 which is equal to 2. The corresponding CUWD is nothing but the well known Alamouti LSTD. 
Case 2: A = 2, g = 4 

The minimum possible dimension in which a CUWD with A = 2, g = 4 exists as per Theorem 6 is 4 
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and the corresponding CUWD is given by: 

X1+X2 + i(x 3 + £4) —X5 - x 6 + i(x 7 + x$) 

x 5 + x 6 + i(x 7 + x 8 ) xi + x 2 - i(x 3 + £4) 

xi - £ 2 + i( x 3 - xa) -x 5 + x 6 + i(x 7 - x$) 

£5 — £6 + *(^7 — 278) Xl — £2 — i(x3 ~ £4) 

where, the grouping of real variables are {xi,X2}, {£3,3:4}, {£5, £6} and {£7,£s}- Note that when A 
increases from one to two, the minimum dimension in which K = Xg matrices with required properties 
exist, also increases. In fact, Theorem [6] explicitly tells that Nt increases linearly with A. This makes 
the rate, which is independent of A. 

1 ) Algebraic description for ABBA construction: 
As stated in Section [III] (also see Example 12.81 ), the algebra over E can also be viewed as a finitely 
generated right module algebra over Cliff n . A general element x of the algebra A^ can be written as 
follows: 

a 

x = ci + 5ic 2 H h 5 a c a+ i + 5i5 2 c a+2 H h (JJ 5 l )c L (10) 

i=i 

where Cj,i = 1, . . . , L £ Cliff n . There is a natural embedding of A^; into Endcuf f„(&n) gi ven by 
left multiplication as shown below: 

<t):A^EndciiffM), 
4>(x) = L x : y ^ xy. 

It is easy to check that the map L x is Cliff n linear and the map is a ring homomorphism. Also, it 
can be proved that the map <p is injective as follows. 

Let 4>{x\) = L Xl and 4>{x2) = L X2 . If <j){x\) = 4>{x2), then 

L Xl (y) = L x 2 {y)Vy 
x\y = x 2 yM y 
(xi - x 2 )y = V y 

which implies £1 — x 2 = or equivalently £1 = x 2 . Hence, we can represent the algebra A^ by matrices 
with entries from Clifford algebra. However, we are only interested in matrix representations with entries 
from the complex field. But this can be easily obtained by simply replacing each Clifford algebra element 
by its matrix representation over C. This is possible because the matrix representation of Cliff n over C 
is well known and is explicitly given in [2]. The resulting weight matrices are guaranteed to be linearly 
independent since <fi is injective. We now illustrate this construction with an example. 
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(13) 



Example 2.10: Consider Ajf for a = 2. Thus A = 4 , g = n + 1 and K = 4(n + 1). A general element 
x G A^j can be expressed as follows. 

x = c\ + Sic 2 + 6 2 c 3 + <5i<5 2 C4 

where, Cj, i = 1, . . . , 4 € Cliff n . Let us obtain a matrix representation over Clif f n for the map L x . We 
have, 

-Ml) = Ci + 5iC 2 + 5 2 C 3 + £l<$2C4 

L x {5i) = (ci + <5ie 2 + 6 2 c 3 + 616204)6! 

= Jici + c 2 + 5i5 2 c 3 + 5 2 c 4 

L x (5 2 ) = (ci + (5ic 2 + <5 2 c 3 + (5i(5 2 c 4 )5 2 (12) 

= 5 2 ci + 5i5 2 c 2 + C3 + <5iC4 

^(^1^2) = (01 + 6102 + 6203 + 616204)6x62 

= 6162C1 + 62C2 + 61C3 + C4. 
The map L x can be represented by the matrix 

Cl C 2 C 3 C 4 

C 2 Cl c 4 c 3 
C3 C 4 Cl c 2 
c 4 c 3 C 2 Cl 

where, ci,c 2 ,C3,c 4 S Cliff n . In order to get a matrix representation over C, we simply replace each 
Ci,i = 1, . . . , 4 by their matrix representations over C. However, we are interested only in a 4-group ML 
decodable LSTD which can be obtained by using the matrix representation of the specific elements 1, 
7i, i = 1,... ,n, 61, 6iji, i = 1, . . . , n, 6 2 , 6 2 ji, i = l,...,n, 6i6 2 , 6i6 2 ji, i = 1, . . . , n as weight 
matrices. This is done by restricting the representation of the algebra to the subspace over R spanned by 
the required elements of the algebra. In other words, we substitute zero for the coefficients corresponding 
to the terms not required (terms involving product of Clifford algebra generators like 71 7 2 are omitted). 
To be precise, each C{ G Clif f n is restricted to be of the form: 

n 

Ci = £(i-i)(„+i)+i +^x(j_ 1 ) (n+ i) +i 7j 
i=2 

for some ijEl, i = 1, . . . , K by forcing the coefficients of the remaining terms as zero. In terms of 
the corresponding matrix representation, this is equivalent to simply replacing Cj, i = 1, . . . , 4 by ODs in 
( fT3l ). Therefore, the above method results in a 4-real symbol ML decodable CUW STBC with maximal 
rate. It turns out that the above construction is precisely the ABBA construction proposed by Tirkkonen 
et al in [5]. 



DRAFT 



November 22, 2008 



27 



As a consequence of this result, it follows that the 4 transmit antenna LSTD based on ABBA construc- 

x\ + 1x2 —X3 + 1x4 £5 + ixq —X7 + ixg 

X3 + 1x4 x\ — 1x2 X7 + ix$ X5 — ixe 

X5 + IXQ —X7 + IX$ X\ + 1X2 —X3 + «X4 



tion given by 



has to be 2-real symbol ML decodable. 



x-j + ix$ x§ — ixq X3 + 1x4 x\ — 1x2 
Though the same LSTD was proposed earlier in [5], the authors of [5] chose the following pairing of 

real variables in a group which essentially resulted in a 4-real symbol ML decodable STBC. 

1) First group {x±,X2} 

2) Second group {2:3, 24} 

3) Third group {25, x%\ 

4) Fourth group {x-j,xg}. 

However, if we form the following partition of real variables, we can obtain a single complex symbol 
ML decodable STBC. 

1) First group {xi,xs} 

2) Second group {x2, xq} 

3) Third group {x3, X7} 

4) Fourth group {x4,x%}. 

Thus we see that the ML decoding complexity of STBCs obtained from linear designs can vary 
dramatically depending on the choice of multidimensional signal set £f. 

2 ) Algebraic description for Tensor product construction: 
In [13], a construction of CUW STBCs based on tensor products was provided without giving any 
reasoning for the mathematical source of such a construction. With the algebraic background that we 
have now developed, the tensor product construction in [13] can be easily explained. Since the group 
G is a direct product of G 7 and G$, from Lemma [T] a representation of G can be obtained as a tensor 
product of a representation of G 7 and that of G$- Unitary matrix representations of G 7 are available 
in [2]. The unitary matrices representing G$ should commute and also square to /. Such matrices are 
simultaneously diagonalizable and their eigen values are equal to ±1 (squaring to I). So a simple method 
to construct A-real symbol decodable STBCs would be to take A linearly independent diagonal matrices 
of size A x A having ±1 as entries and then tensor them with representation matrices of the generators 
of Gj. The construction suggested in [13] is precisely based on this principle. One advantage of the 
construction in [14] is that it provides CUWDs for all even number of transmit antennas. 
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3) On the maximal rate of non-CUW STBCs: 
It is important to note that Theorem [6] provides the optimal rate-ML decoding complexity tradeoff only 
within the class of CUW STBCs. The rate of a general g-group ML decodable STBC can in fact be more 
that of a CUWD. An example of a such a LSTD is the recently found high rate quasi-orthogonal STBC 
in [58]. This LSTD for 4 transmit antennas which was found by an exhaustive computer search has a 
rate of 2.5 dpcu and is 2-group ML decodable. This solitary example for 4 transmit antennas shows that 
there is a lot of room for further work in the direction of increasing transmission rate of g-group ML 
decodable STBCs. 

III. Multi-group ML decodable Distributed STBCs 

In this section, multi-group ML decodable distributed space time block codes (DSTBCs) are discussed. 
In Subsection IIII-AI we present a generalization of the distributed space-time coding strategy proposed in 
[33] and derive a code design criteria for full diversity. In Subsection IIII-BI the necessary and sufficient 
conditions for multi-group ML decoding of DSTBCs are provided. Three new classes of four group 
decodable DSTBCs are constructed in Subsection IIII-CI 

A. Distributed Space-Time Coding 

Consider a network consisting of a source node, a destination node and R relay nodes which aid 
the source in communicating information to the destination. All the nodes are assumed to be equipped 
only with a single antenna and are half duplex constrained, i.e., a node cannot transmit and receive 
simultaneously in the same frequency. The wireless channels between the terminals are assumed to quasi- 
static and flat fading. The channel fading gains from the source to the i-th relay, f\ and from the i-th 
relay to the destination are all assumed to be independent and identically distributed complex Gaussian 
random variables with zero mean and unit variance. Symbol synchronization and carrier frequency 
synchronization are assumed among all the nodes. Moreover the destination is assumed to have perfect 
knowledge of all the channel fading gains. 

Every transmission cycle from the source to the destination comprises of two phases-broadcast phase 
and cooperation phase. In the broadcast phase, the source transmits a T(T > R) length vector \J-k\Pt, 
which the relays receive. Here, P denotes the total average power spent by all the relays and the source. 
The fraction of total power P spent by the source is denoted by The vector z satisfies E[z^z] = T 
and represents the information that the source intends to communicate. The received vector at the j- 
th relay node is then given by rj = ^iriPfjZ + vj, where vj ~ CM(0,It)- During the cooperation 
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phase, all the relay nodes are scheduled to transmit together. The relays are allowed to only linearly 
process the received vector rj or its conjugate rj*. To be precise, the j'-th relay node is equipped with 
a T x T matrix Bj (called relay matrix) satisfying 
t 



Bj \\p= T and it transmits tj 



7T1.F+1 J J 



i j \/ ^^"p^ Bjrj*. Here, 7T2 denotes the fraction of total power P spent by a relay. Without loss of 
generality, we may assume that the first M relays linearly process rj and the remaining R — M relays 
linearly process rj*. If the quasi-static duration of the channel is much greater than 2T time slots, then 

the received vector at the destination is given by 

R 

y = D#tj + w = 



7T17T2-P 2 



3=1 



TTlP+1 



Xh + n 



(14) 



where, 



X 
h 



Biz 



Bmz Bm+iz* 



Brz* 



n 



flQl f292 ■ ■ ■ fM9M f*M+l9M+l 
r, i M R 



7T 1 P + 



f*R9R 
+ w, 



(15) 
(16) 



T E* B J v j+ E * B jVj* I +w, (17) 

\j=l j=M+l 

and w ~ CN(0, It) represents the additive noise at the destination. The power allocation factors m and 



7T2 must satisfy tt\PT + R-K2PT = P(2T). Throughout this paper, we choose tt\ = 1 and 1T2 = j=> 
suggested in [33]. Let Y denote the co variance matrix of n. We have, 

vr 2 P 



as 



R 



T = E[nn^] = I T + 



T Q>| 2 B iBi *). 



(18) 



i=i 



1T 1 P + 

The vector z transmitted by the source is taken from a finite subset of C T which then defines a 
collection of matrices when substituted for in X as given in (fT5l ). This finite set of matrices is called 
a DSTBC since each column of a codeword matrix is transmitted by geographically distributed relay 
nodes. The destination node performs ML decoding as follows: 



X 



are; mm 



™ p2 -xh) HI 



(19) 



w VttiP + 1 

Observe that if the entries of z are treated as complex variables, then the DSTBC can be viewed 
as being obtained from certain special LSTDs having the form of ([TBI ). Note that such LSTDs have the 
property that any column has linear functions of either only the complex variables or only their conjugates 
respectively. We refer to LSTDs with this property as 'conjugate LSTDs'. The following theorem provides 
sufficient conditions under which the DSTBC ^ achieves full cooperative diversity equal to R under ML 
decoding. 
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Theorem 7: Assume that T > R, iri = 1 and tt2 = ^. If BiBi H is a diagonal matrix V i = 1, . . . , R 
and if AX = X ; - Xj has full rank for all pairs of distinct codewords X i; Xj G tf, then the DSTBC ^ 
achieves full cooperative diversity equal to R under ML decoding. 

Proof: Let Xi be the transmitted codeword, Xj be some other codeword and let AX = Xi — Xj. 
We have, 



P(y|Xi) 



- (y- V ^xh)«r- (y - yj ^xh) 



vr T |r| 

On applying the Chernoff bound, we can upper bound the pairwise error probability (PEP) that a ML 
decoder decodes wrongly to Xj as follows: 

PEP < E e -^ J h-(Ax)-r- 1( Ax)h_ (2Q) 
{fi}{9i} 

Since BiB^ is a diagonal matrix Vi = 1, . . . , R, let /i denote the maximum of the diagonal entries 
of BiBf 1 over all i = 1, . . . , R. Let D = (l + J \ gi \^ i T . Now by replacing T by D, we 

can further upper bound the PEP expression in (1201 ) since this is essentially equivalent to assuming more 
noise variance at the destination than what is actually present and hence results in an upper bound. Thus, 
we have 

p E p< E e -i^jh«(AX)«D-(AX)h 

~ !/,}{.'/.} 

On integrating over the /j's as done in Appendix I of [33], we get 

PEP< E |I R + /^^(AX)^D- 1 (AX)diag{| 9l | 2 ,b 2 | 2 ,...,| 5R | 2 }r 1 . 
{gi} 4(^+1) 

For the power allocation iri = 1, 7T2 = ^ and for large P, we can approximate the above expression 
as follows: 



E P 

PEP < |Ir + — . (AX) g (AX)diag{|g 1 | 2 , \g 2 \ 2 , \g R \ 2 } \~\ (21) 

{9i} 4(i? + /i£*iM 2 ) 

Now proceeding as in Appendix II of [33], it can be shown that the above expectation can be further 

upper bounded to result in: 
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/ C |(AX)g(AX)|* V* p -R(i- W ) 

where, c is some constant independent of P. This completes the proof. ■ 
Theorem |7] generalizes the results of [33] (wherein only unitary relay matrices were permitted) to allow 
row orthogonal relay matrices (BiBi^ is a diagonal matrix). Note that the transmission protocol assumed 
in this paper does not involve communication using the direct link between the source and the destination. 
An even more general transmission protocol called as 'GNAF protocol' which employs the direct link 
and also allows a general form of linear processing at the relays along with unequal duration of broadcast 
phase and cooperation phase is discussed in [49]. Note that if the direct link is also employed, then a 
maximum diversity of R+ 1 can be achieved [49]. However, for the purposes of this paper, the results of 
Theorem [7] are sufficient. We shall see in the sequel that relaxing Bi to row orthogonal matrices paves 
the way to obtain DSTBCs with low ML decoding complexity. Hence, for constructing DSTBCs we 
need conjugate LSTDs whose relay matrices have orthogonal rows. This is one of the major differences 
between collocated STBCs and DSTBCs. 

B. Conditions for Multi-group ML decoding of DSTBCs 

The ML decoding complexity of DSTBCs becomes an important issue especially when R is large. This 
provides a good motivation to study multi-group decodable DSTBCs. The following theorem provides 
necessary and sufficient conditions for multi-group ML decoding of DSTBCs. 

Theorem 8: A DSTBC ^ is g-group decodable if and only if the following two conditions are satisfied. 

1) is g-group encodable 

2) The associated basic matrices Ai, i = 1, . . . , K of "io satisfy: 

APr^ 1 Aj + Af T x Ai = (22) 
whenever Ai and Aj belong to different groups. 

Proof: Let y = T 2y. Then from (IT9T ). the ML decoding metric is given by 
II y ~~ y^ ttiP+i (r -i X)h 11^- Compared to the collocated case, the difference here is the term involving 
r 2 . The effect of pre-multiplication by T 2 can be captured by considering T 2 X as a LSTD whose 
basis matrices are given by T~2 A}, i = 1, . . . , K. Now applying the conditions for g-group ML decoding 
of collocated STBCs, we get the condition for g-group ML decoding of DSTBCs to be that: (1) ^ should 
be 5-group encodable and (2) whenever Ai and Aj belong to different groups, they should satisfy 

(r-Ui)"(r-Uj) + (r-iAj) H (r-3Ai) = o 
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which, on simplification gives (1221) . ■ 
Note from ( fl~8l ) that if all the relay matrices are restricted to be unitary as in [33], then F becomes a 
scaled identity matrix which in turn makes the condition in (1221 coincide with that for collocated STBCs. 
To summarize, a g-group ML decodable collocated STBC qualifies to become a g-group ML decodable 
DSTBC if it satisfies the below three conditions. 

1) The associated LSTD X = ~}2d=i x i&-'\ is a conjugate LSTD. 

2) The associated relay matrices are row orthogonal, i.e., BiB^ is a diagonal matrix. 

3) Equation (21) is satisfied by the associated basis matrices Aj, i = 1, . . . , K. 

Example 3.1: Consider the 4x4 single real symbol ML decodable (6-group ML decodable ) STBC 



from 4x4 orthogonal design given by 



Z\ 


— 7* 


— 7* 







* 

z l 







zz 





Zl 


-Z 2 





Z3 


7* 

"2 


Zl 



. Note that it is not a conjugate LSTD 



and hence does not qualify as a DSTBC. 
Example 13.11 demonstrates that though orthogonal designs and hence single real symbol ML decodable 
collocated STBCs are well known in the literature, the transition to distributed case is not straightforward. 
Thus it is clear that it is more difficult and challenging to construct multi-group ML decodable DSTBCs 
compared to multi-group ML decodable collocated STBCs. 



C. Four group decodable DSTBCs from Precoded CIODs 



Towards constructing four-group decodable DSTBCs, consider the following example. 
Example 3.2: Consider the 4x4 CIOD [11] shown below 



X 



CIOD 



V2 





-4 








Z2 


z\ 














z-i 


* 

-z\ 








2 4 


7* 

^3 



where, z\ = x\ + 1x2, zi = X3 + 2x4, Z3 = X5 + ixq and 24 = x-j + ix$ are complex variables. It is 
clear that ~Kciod is a conjugate LSTD. We shall now see how ~Kciod is actually a 4-group decodable 
DSTBC. Let the number of relays R = 4. In the broadcast phase let the source transmit the vector 

l T 



Zl Z2 Z 3 Z4 



, where the information symbols {xi,x$}, {x2,xq}, {3:3,3:7}, {x^,xs} are 
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each taken from a rotated QAM constellation as given below 







cos 9 


— sin 9 




yu 


Xi+A 




sin# 


cos 9 




ViQ 



where, 



yu 

ViQ 



1, ... 4, take values from a QAM constellation and 9 is an appropriately chosen 



rotation angle [11] so that the resulting DSTBC satisfies the rank criterion for full diversity according to 
Theorem [7] For X-ciod, the value of M = 2 and the corresponding 4 relay matrices are: 



Bi 



B, 



The corresponding matrix T is given by 

vr 2 P 



V2 













-V2 








V2 







V2 












,B 2 = 




























































































, B4 = 











V2 













-V2 








V2 







V2 






I4 + 



vriP+ 1 

The weight matrices for Xc/od are given as follows: 



2(m 2 + m 2 )/ 2 

2(| 53 | 2 + M 2 )/ 2 



Ai = V2 



A 4 = V2 



A 7 = V2 



10 

10 





i 

1 




-1 
10 



V2 



V2 



,A 8 = V2 



i 
-i 




10 
1 


i 
i 



V2 



V2 





1 









i 





-10 







— i 



November 22, 2008 



DRAFT 



34 



It is easy to check that all the weight matrices are row orthogonal and they satisfy (1221 ) for A = 1. 
This is because of the special block diagonal structure of ~Kciod with each block being a replica of 
the Alamouti LSTD. The resulting DSTBC will achieve full cooperative diversity and is 4-group ML 
decodable or equivalently one complex symbol ML decodable. 

We now generalize the LSTD ~Kciod given in Example 13.21 for any number of relays having the 
special feature of 4-group decodability. We call these LSTDs as 'Precoded CIODs' (PCIODs). 
Construction of Precoded CIOD for even number of relays: 
Given R an even number, the R x R PCIOD ~X.pciod for R relays is given by (l23l . 

„ ,. I xx+ix-2 -x 3 +ixi x k + jx k + 1 -x k + 2 +jx k + a %2R- 3 + ix 2R - 2 ~x 2 R- 1 + ix 2 R I 

I x 3 + lx t x x - IX 2 x k + 2 +jx k + 3 x k - jx k+1 X 2R _ 1 +IX 2R X2R-3 - »I2H-2 I 

(23) 

There are totally 2R real variables X\,X2, ■ ■ ■ ,X2R in the conjugate LSTD Xpciod- The following 
expression shows that X.pciod is not fully diverse for arbitrary signal sets. 

\(AX PCI od) H (AX PC iod)\= (j2 Ax ") ...(X>^ 2 ) •••( 

\i=l J \i=k ) \i 

However, constellation precoding can be done to achieve full diversity. Precoding is to be done in the 

following manner. The 2R real variables are first partitioned into 4 groups as follows: 
. First group: {x 1+4k \k = 0, 1, . . . , 

• Second group: {x 2+ 4k\k = 0, 1, . . . , ^f^} 
. Third group: {x 3+4k \k = 0,1,..., 

• Fourth group: {x i+Ak \k = 0,1,..., ^f^}. 

There are ^ real variables in each group. Now let X}, i = 1, . . . , 4 denote the LSTDs corresponding to 
only the real variables in the i-th group respectively. Now ~Kpciod = Ylt=i -^-i- ^ so ^ can De checked 
that 

4 

(AXpcjod) 11 (AXpcwd) = ^ AXf AX,. 

i=i 

Supposing the constituent STBCs corresponding to LSTDs Xj, i = 1, ... ,4 are fully diverse, then 
|(AXj)' ff (AX i )| > V i = 1, ... ,4 and on application of Corollary 4.3.3 in [63], we get 




\(AXpciod) H (AX P ciod)\ > .minjKAXf )(AX)|} . 

Thus we see that if the constituent STBCs are fully diverse, then the resulting STBC from PCIOD 

——l 

will also be fully diverse. Note that |(AXj) H (AXj)| = (]l/=o Ax i+4j ) 2 which is nothing but the 
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product distance. Hence, if we let the ^ real variables in a group to take values from a rotated Z~ 
lattice constellation which is designed to maximize the minimum product distance then full diversity is 
guaranteed. Algebraic number theory provides effective means to construct rotated Z n lattices with large 
minimum product distance [47], [59] for any n G N and the corresponding lattice generator matrices can 
be explicitly obtained from [59] for dimensions upto 30. Due to the block diagonal nature of X.pciod 
with replicas of Alamouti designs on the blocks, the resulting DSTBC will be a full diversity 4-group 
decodable DSTBC. The following example illustrates the construction procedure for R = 6. 
Example 3.3: The PCIOD for 6 relays is as shown below: 



diag 
where, 



X\ + ZX2 —X3 + 1X4 
£3 + 2X4 X\ — 1X2 



x 5 + ix e -x 7 + ix 8 
x-j + ixs x§ — ixQ 



x 9 + ix 10 -xn + ixu 
xio + ix u x 9 - ix 10 







Vi 




Xi+4 


= g 


Vi+i 


,i = 1.....4 


Xi+8 




Vi+8 





and the vectors 



Vi 

Vi+8 



,1 



1, 2, 3, 4 take values from a subset of I?. The 3x3 lattice generator matrix 



Q can be taken from [59]. At the destination, the ML decoding of the real variables {xi, Xj+4, Xi+$} has 
to be done jointly for each i = 1,2,3,4 separately. Thus the resulting DSTBC is 4-group decodable or 
3-real symbol decodable. 

Construction of Precoded CIOD for odd number of relays: 

If R is odd, then construct a PCIOD for R + 1 relays and drop the last column to get a (R + 1) X R 
LSTD. For example, a single complex symbol decodable code for 3 relays can be obtained from Example 
by dropping the last column. This is shown in the following example. 
Example 3.4: 

x\ + ix2 X3 + ix^ 
— X3 + 1x4 X\ — ix 2 

X5 + ixq 
— X7 + ix$ 
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1 ) Encoding complexity at the relays for PCIODs: 
By observing the structure of the relay matrices of PCIOD one can see that it has zeros everywhere except 

for a single non-zero 2x2 sub-matrix which is a scaled version of either identity matrix or 



-1 

1 



Thus having received two complex numbers, say 
and transmitting one of the following 



a + ib 
c + id 



a + ib 




—a + ib 




or 




c + id 




c — id 



, a relay should be capable of generating 
both of which require significantly 



less complexity as compared to multiplying the received vector by an arbitrary complex matrix. 

2) Resistance to relay node failures: 
Note that any two columns of the PCIOD are orthogonal. This leads to the property that even if any 
column of the design is dropped, it continues to satisfy the full rank condition. This property is important 
since even if certain relay nodes fail, which is equivalent to dropping few columns of the LSTD, the 
residual diversity benefits are still guaranteed and that too with no additional increase in ML decoding 
complexity. 

Thus we have a constructed a class of 4-group decodable DSTBCs for any number of relays having 
the following salient features: 

1) Transmission rate of the source is 0.5 complex symbols per channel use 

2) Full diversity 

3) Four group ML decodable 

4) Low encoding complexity at the relays 

5) Resistance to relay node failures 



D. Four group decodable DSTBCs from extended Clifford algebras 

In the previous subsection, a class of 4-group decodable DSTBCs was constructed for arbitrary number 
of relays from PCIODs. Amidst many advantages, PCIODs do have a drawback that the power distribution 
among the relays is not uniform across time slots which is due to the large number of zeros in the LSTD. 
This leads to a large peak to average power ratio (PAPR) problem at the relays which is undesirable 
since it demands the use of larger power amplifiers at the relays. Moreover since the relay matrices 
of PCIODs are not unitary, this forces the destination to perform additional processing to make the 
noise covariance matrix a scaled identity matrix, i.e., pre-multiplying the received vector by . Above 
all, the construction of PCIODs was not obtained from a systematic algebraic procedure targeting the 



DRAFT 



November 22, 2008 



37 



requirements for 4-group decodable DSTBCs. Hence it is natural to ask whether there exists a systematic 
algebraic construction of 4-group decodable DSTBCs with unitary relay matrices and uniform power 
distribution across the relays and in time. 

In this subsection, using the algebraic framework of extended Clifford algebras introduced in Subsection 
III-BL two new classes of 4-group decodable DSTBCs with unitary relay matrices as well as unitary weight 
matrices for power of two number of relays are constructed. 

As discussed in Subsection III-BI to construct 4-group decodable DSTBCs, we need 4 matrices (in- 
cluding identity matrix) in the first row (as shown in Table H]). One way to obtain such matrices is to take 
the matrix representation of A3 for L = 2 a , a G N. The matrix representation of the symbols 1 , 71 , 72 , 73 
respectively can be used to fill up the first row. Interestingly there is yet another way of obtaining such 
matrices. Let us look at A2 for L = 2 a , a G N. The symbols 71 and 72 square to —1 and anti-commute. 
However, note that 

(7271 ? = -1 
(7271)71 = -7i(727i) • ( 24 ) 
(7271)72 = -72(7271) 
Thus the symbol 7271 also squares to —1 and anti-commutes with the symbols 71 and 72. Thus we can 
fill up the first row with the matrix representations of the symbols 1,71,72,7271 respectively. Thus we 
can get two classes of 4-group decodable DSTBCs, one from A3 and the other from A2 if the problem 
of conjugate linearity property and unitary relay matrices are also taken care. 

1 ) Matrix Representation: 
There are several ways to obtain a matrix representation of an algebra. We need to take an appropriate 
matrix representation such that the following conditions are met. 

1) The symbols 1, 71, 72, . . . , 7„, 5 k , for k = 1, . . . , a, and Um=2 U4L1 S k t for 1 < h < k i+1 < a 
should be represented by unitary matrices. 

2) The resulting LSTD should be a conjugate LSTD. 

3) All the relay matrices should be unitary. 

Such matrices are naturally provided by the left regular representation of the associative algebra A^. 
Left regular representation is an easy way to obtain the matrix representation for any finite dimensional 
associative algebra [64]. Such techniques have been previously used in [9], [10] to obtain the matrix 
representation of division algebras and crossed product algebras. The first requirement of unitary matrix 
representation is met because the natural basis elements of A^; over R together with their negatives form 
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a finite group under multiplication (see Proposition [3). This fact in conjunction with the properties of 
left regular representation guarantee a unitary matrix representation for the required symbols. We shall 
prove the other properties after illustrating the construction procedure for both the codes from and 

A 3 L 

2) DSTBCsfrom : 

We first view A2 as a vector space over C by thinking of 71 as the complex number i = yj—l. A natural 
C basis for A2 is given by 



Thus the dimension of Ag seen as a vector space over C is 2 n+a 1 . We have a natural map from A2 to 
Endc(A2 ) given by left multiplication [9], [10], [64] as shown below. 



Since the map L x is C linear, we can write down a matrix representation of L x with respect to the 
natural C basis B^. Thus we obtain a LSTD satisfying the requirements of ([3]) for g = 4. 

Example 3.5: Let us begin with the simplest case of R = 2 1 relays. Let n = 2. Then equating 
n + a — 1 = 1, we get a = and hence L = 1. But the algebra Ag is same as Cliff (2) which is nothing 
but the Hamiltonian Quaternions H. It is well known [9] that the left regular matrix representation of EI 
yields the popular Alamouti design. Thus we see that our algebraic code construction which was driven 
by the need for low ML decoding complexity naturally leads to the Alamouti design. 

Example 3.6: Suppose we want a LSTD for R = 8 = 2 3 relays. Let n = 2. Then we need n+a— 1 = 3. 
Thus a = 2 and L = 4. A general element of the algebra A| looks like 



where, z,- € C 3 Vi = 1, . . . , 8. We shall now find the matrix representation of L x by finding out the image 
of the basis B\ under the map L x which is shown below. 




(25) 



(f> : A2 1— > Endc(A^) 
4>(x) = L x : y ^ xy 



(26) 



X = Z\ + 5\Z2 + 52Z 3 + <5i<5 2 £4 + 72-^5 + (5l72^6 + ^272-27 + <5l<5 2 72' z 8 



DRAFT 



November 22, 2008 



39 



L x (l) 
L x {5x) 
L X (S 2 ) 

^(#172) 
L x (5i52j2) 



Z\ + 5\Z 2 + #2-23 + #l#2-24 + 72-25 + #172-26 + #272-27 + #1#272<28 
5\Zi + Z 2 + #l#2-23 + #224 + #172-25 + 72-26 + #l#272-27 + #272-28 
5 2 Zi + 5i5 2 Z2 + Z 3 + 81Z4 + #272-25 + #l#2 72-26 + 72-27 + #172-28 
#2#2-2l + #2-22 + #1-23 + -24 + #l#2 72-25 + #2 72-26 + #172-27 + 72-28 
(z 1 + #1^2 + #2^3 + #1#2^4 + 72^5 + #172^6 + #272^7 + #l#272Zs) 72 



72-21 + #17222 + #2724 + #1 #272*1 



8 x zl - 8 2 z* 7 



#l#2^8 



#172^ + 724 + #l#2 72-23 + #2724 - #1-25 - Z% - 8182$ ~ #2-28 



#272^ + #1#272^2 + 724 + #172*4 - #2^5 - #24 



#1*8 



#1#272^ + #2724 + #172-23 + 72^1 - #l#2-25 ~ #2-26 ~ #1-2? 



The matrix representation of L x is thus given by: 



[L, 



Z\ 


Z2 


Z3 


Z4 




~ z 6 


-z 7 


~ z 8 


Z2 


z\ 


Zi 


Z3 


— 7* 


— 7* 


— 7* 
z 8 


— 7* 
z 7 


Z3 


ZA 


Z\ 


Z2 


-4 


~ z 8 


~ z 5 


* 


z 4 


Z3 


22 


Z\ 


-4 


— 7* 
z 7 


— 7* 
z 6 


— 7* 
z 5 


Z5 


Z6 


Z7 


Z8 


7* 


7* 
x -2 


* 

z 3 


* 

z A 


ze 


Z5 


Z8 


Z7 


2 2 


Z l 


* 

z 4 


* 

Z 3 


Z7 


Z8 


Z5 


Z6 


7* 
z ?> 


7* 
z 'l 


7* 


■7* 
z 2 


Z8 


Z7 


Z6 


Z5 


* 

2 d 


Z3 


* 

z 2 


* 

4 



Also, we have that 

X = ZU + 71Z1Q + #1.Z 2 I + #171 Z 2 Q + #2^37 + #271 Z 3Q + #1#2^47 + #l#27l*4Q + 72^57 



(27) 



(28) 



(29) 



(30) 



+7271 Z 5Q + #172^67 + #17271 Z 6Q + #272^77 + #27271 Z 7Q + #1#272^87 + #l#2727l*8Q- 

Since the map ^ is a ring homomorphism, we have 

L x = (f)(1) Z U (f)(1) + <^(7l)ZlQ + (f)(5i)z 2 I + 0(#l7l)^2Q + 0(#2)2;37 + <?K#27l)*3Q 

+^(#1# 2 )2 4 / + 0(#l#27l)2 ; 4Q + ^(72)^57 + ^(727l)^5Q + ^(#172)^67 + <K#l727l)*6Q 
+^(#272)^77 + ^(#2727l)^7Q 
+(f)(5i #272)^87 + ^(#l#2727l)^8Q- 

The equation (l30l explicitly gives the design [L x ] in terms of its weight matrices. Because of our 
algebraic construction, the weight matrices can be partitioned into four groups such that ® is satisfied. 
The four groups are 

1) {0(l),<A(#l),<A(#2),0(#l# 2 )} 
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2 ) {0(7i)> 0(<*i7i)j H s 2li), 4>(5iS 2 'yi)} 

3) {0(72), 0(^172), 0(^272), 0(o"i5 2 7 2 )} 

4) {0(727i)> 0(°i727i)> 0(^2727i)> 0(<5i £27271)} 

respectively. Expressing the real variables of the resulting design and their corresponding weight matrices 
in the form of a tabular column as shown in Table HI we get 



0(1) 


0(71) 


0(72) 


0(727l) 


Zll 


ZlQ 


Z5I 


25Q 






0(01)0(72) 


0(^l)0(727l) 


Z21 


Z2Q 


^6/ 


z&Q 


<i>(5 2 ) 


0(5 2 )0(7i) 


0(^)0(72) 


0(^2)0(727i) 


Z3I 


Z3Q 


27/ 






0(5l52)0(7l) 


0(5l52)0(72) 


0(^l^)0(727l) 






^8/ 


Z8Q 



In general for R = 2 m relays we can take the left regular representation of A2" 1 1 to obtain a 4-group 
decodable LSTD. These LSTDs were first obtained using a non-algebraic iterative construction procedure 
in [51]. The algebraic framework presented in this paper provides an interesting algebraic description for 
codes in [51]. 

Remark 9: Note that in general to represent L x as a matrix one could have chosen any basis for 
instead of the natural basis B^- However, only the natural basis will lead to a design with the low 
decoding complexity requirements, although a different basis will also give a representation of the same 
algebra. This shows that although two designs can be algebraically isomorphic, the choice of basis is 
crucial and only certain basis admit low decoding complexity. Further, even changing the ordering of 
the natural basis can result in designs which apparently look very different. But this is same as simply 
applying a permutation to the rows and columns. 

3) DSTBCsfrom Af: 

We use a slightly different approach to obtain codes from A3 . Let us first consider the algebra, A3 which 
is nothing but Cliffy. A general element of Cliffs looks like 

x = 01 + 7102 + 7203 + 73^4 + 717205 + 727306 + 717307 + 71727308 ( 31 ) 

for some dj £ K,i = 1,...,8. Note that we have used the natural K basis of Cliffs to represent an 
element of Cliffs- The element 717273 satisfies the following properties. 
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(717273) 2 = 1 

(71 ) (717273) = (717273) (71 ) 
(72X717273) = (717273X72) 
(73X717273) = (717273X73) 

Thus the element 717273 squares to 1 and commutes with all the generators of Cliffs. Hence the matrix 
representation of the element 717273 can be used as a candidate to fill up the first column. Since we 
have now filled up two matrices (including identity matrix) in the first column, it should be possible to 
get a 2-real symbol decodable STBC using matrix representation of Cliffs. From Subsection III-BI we 
know that the remaining weight matrices are simply the product of matrices in the first row and those 
in the first column. We have, 



(71) (717273) = -7273, (72X717273) = 7i73 and (73X717273) = -71 72- (33) 

It so turns out that the elements of {1, 71, 72,73, —7172, -7273, 71 73) 717273} also form a basis for 
Cliffs over R Thus a general element of Cliffs can be expressed as 

x = ai + 71 a 2 + 7203 + 73^4 + (-7172)05 + (-7273)^6 + (7173)07 + 717273^8 ( 34 ) 

for some ai G R, i = 1, . . . , 8. By thinking of the element 71 as the complex number i = y/—\, we can 
view Cliffs as a vector space over C. To be precise, 

x = (ai + 7ia 2 ) +7 2 (a 3 + 7105) + 73(04 - 7ia 7 ) +727 3 (-a 6 + 7ia 8 ) 

= Zi+ 7 2 Z 2 + 73^3 + 7273^4 

where, Z{ G C, % = 1, . . . , 4 and are given by z\ = (ai + 7102), Z2 = («3 + 71^5), £3 = («4 — 71O7) and 
z± = (-a 6 + 71 a 8 ). 

Now using left regular representation as in the case of codes from , we get 

L x (l) = z 1 + 72 z 2 + 73^3 + 7273^4 

L x (j 2 ) = 72 ^ -4~ 72734 + 73^4 

M73) = 134 + 72734 -4- 72^1 (36) 

-M7273) = 7273^1 + 7^73^2 + 737273^3 - Z4 
= 7273^1 - 73^2 + 72^3 - Z4 

Hence we obtain the following LSTD [L x ] 
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Zl 


-4 


— 7* 

z 3 


~Z4 


Z2 




— 7* 

Z A 


Z3 


Z3 


* 

z a 


z l 


-z 2 


Z 4 


— 7* 


7* 

z 2 


Zl 



(37) 



By construction, the weight matrices and the real variables of the LSTD [L x ] can be partitioned into 
four groups for decoding purposes which is illustrated in the following table. 



0(1) 


0(71) 


0(72) 


0(73) 


Z\I 


ZlQ 


Z21 


Z3I 


0(717273) 


0(-7273) 


0(7173) 


0(-7i72) 


Z AQ 


Zai 


Z 3Q 


Z 2Q 



In general, for R = 2 m relays we can take the left regular representation of A3" 1 2 . 
Example 3.7: Suppose we want a design for R = 8 = 2 3 relays. Then, we have m = 3. Consider the 
algebra A3. A typical element x can be expressed as 

X = Zi + 72Z2 + 73^3 + 7273^4 + #1^5 + 5l72 ^6^173^7 + ^17273^8 

where,zi = (zu + ^\z\q), z 2 = (z2i + 1\Z2Q), z 3 = {z 3 i - jiz 3 q), z 4 = (-zai + 7i^4q), z 5 = 
(^5/+7i^5q), zq = (z 6 i+J!Z 6 q), z 7 = (z 7I -jiz 7Q ), z 8 = (-z 8 i+^iz 8 q) and zu,ZiQ eK,i = l,...,8. 
Using left regular representation, we get the following LSTD 



zi -zl 




-ZA 


Z5 




-z 7 


~Z8 


Z2 Z\ 


— 7* 
z A 


Z3 


Z6 


7* 
z 5 


— 7* 

Z 8 


z 7 


Z3 Zl 


z i 


-Z2 


z 7 


z 8 


* 

z 5 


-ze 


z 4 -zl 


z 2 


Zl 


Z8 


7* 
z 7 


* 

z e 


Z5 


Z5 -Zl 


— 7* 

z 7 


-Z8 


Zl 


-4 


— 7* 

z 3 


-ZA 


ZQ Zl 


* 

~ z 8 


z 7 


Z2 


4 


-4 


Z3 


z 7 zl 


4 


-ZQ 


Z3 


4 


4 


-Z2 


z 8 -z* 


4 


Z5 


ZA 


-4 


4 


Zl _ 


of real variables 


are {zu, 


zaq, z 5 i 


Z8Q}, {ZlQ 



(38) 



{z2i, z 3Q , z 6I , z 7Q } and {z 3I , z 2 q,z 7I , z 6Q } respectively. 

4) Features of DSTBCs from extended Clifford algebras: Note that both the LSTDs from A2 and A3 
are conjugate linear. This is by virtue of the properties of left regular representation. While taking the 
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left regular matrix representation, recall that we viewed the algebra as a vector space over C by thinking 
of the element 71 as the analogue of the complex number i = Any column of the design was then 

obtained as the image of a few elements of the natural basis of the algebra under the map L x . All the 
elements of the natural basis of have the property that they either commute with 71 or anti-commute 
with 71 . When we find the image of a basis element say a under the map L x , recall that we moved 
a past a complex number Z{. If a commutes with 71, then it leaves the complex number intact. If a 
anti-commutes with 71 , then it inflicts conjugation while moving past the complex number. This property 
leads to conjugate LSTDs. This fact can be clearly observed in for instance d27l ). Moreover it can be 
easily observed that all the relay matrices of the resulting designs are unitary. This is because the number 
of complex variables in the design is equal to the size of the matrix and by virtue of the left regular 
representation any complex variable appears only once in any column and also they appear in different 
positions in every column. Full diversity is guaranteed for both these classes of LSTDs because they are 
CUWDs and full diversity aspects for general CUWDs have been discussed in Subsection IH-B.il Also 
note that both these constructions meet the optimal rate -ML decoding complexity as stated in Theorem 
[6] for g = 4. The constructed DSTBCs can be easily described and they have a very nice structure which 
is due to the algebraic approach. 

IV. OFDM based Distributed Space Time Coding for Asynchronous Relay Networks 

In this section, we consider symbol asynchronism among the relays and propose an OFDM based 
transmission scheme that can achieve full cooperative diversity in asynchronous relay networks. This 
transmission scheme is a generalization of the Li-Xia transmission scheme in [39]. We briefly review the 
Li-Xia [39] transmission scheme in Subsection IIV-AI and in Subsection IIV-BI we describe the proposed 
transmission scheme and also provide code constructions based on the four group decodable DSTBCs 
constructed in the previous section. Finally, in Subsection IIV-CI it is shown how differential encoding at 
the source node can be combined with the proposed transmission scheme in Subsection IIV-BI to arrive 
at a transmission scheme for non-coherent asynchronous relay networks. 

An asynchronous wireless relay network is depicted in Fig. [5J The overall relative timing error of 
the signals arrived at the destination node from the i-th relay node is denoted by 77 Without loss of 
generality, it is assumed that t\ = 0, Tj+i > Tj, i = 1, . . . , R — 1. The relay nodes are assumed to have 
perfect carrier synchronization. The destination node is assumed to have the knowledge of all the channel 
fading gains /«, gi,i = 1, . . . , R and the relative timing errors Tj, i = 1, . . . , R. All the other assumptions 
are same as that made for the synchronous wireless relay network case. 
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Fig. 5. Asynchronous wireless relay network 



In this OFDM based transmission scheme, the transmission of information from the source node to 
the destination node takes place in two phases. In the first phase, the source broadcasts the information 
to the relay nodes using OFDM. The relay nodes receive the faded and noise corrupted OFDM symbols, 
process them and transmit them to the destination. 

A. Li-Xia transmission scheme [39] 

The source takes 2N complex symbols Z{j, < i < N — 1, j = 1,2 and forms two blocks of data 

r i T 

denoted by aj = zq j z\ j ... zs-ij i3 = 1, 2. The first block ai is modulated by iV-point 
Inverse Discrete Fourier Transform (IDFT) and a.2 is modulated by iV-point Discrete Fourier Transform 
(DFT). Then a cyclic prefix (CP) of length l cp is added to each block, where l cp is not less than the 
maximum of the overall relative timing errors of the signals arrived at the destination node from the relay 
nodes. The resulting two OFDM symbols denoted by ai and &2 consisting of L s = N + l cp complex 
numbers are broadcasted to the two relays using a fraction 7Ti of the total average power P consumed 
by the source and the relay nodes together. 

If the channel fade gains fi,gi, i = 1, . . . , R are assumed to be constant for 4 OFDM symbol intervals, 
the received signals at the i-th relay during the j-th OFDM symbol duration is given by 

r ij = Mj + vij 

where, Vij is the additive white Gaussian noise (AWGN) at the i-th relay node during the j-th OFDM 
symbol duration. The two relay nodes then process and transmit the resulting signals as shown in Table 
JI] using a fraction 7T2 of the total power P. The notation £(.) denotes the time reversal operation, i.e., 

C(r(n)) = r(L s - n). 

The destination removes the CP for the first OFDM symbol and implements the following for the 
second OFDM symbol: 
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TABLE II 

Alamouti code based transmission scheme 



OFDM Symbol 


Ui 


u 2 


1 




1 T2-P r * 
V vriP+l 172 ' 2 


2 




^/^C(r 2 ,i*) 



1) Remove the CP to get a iV-point vector 

2) Shift the last l cp samples of the N -point vector as the first l cp samples. 

DFT is then applied on the resulting two vectors. Since l cp > r 2 , the orthogonality between the 
sub carriers is still maintained. The delay in the time domain then translates to a corresponding phase 



change of e 



in the /c-th sub carrier. Let d T2 denote 



i27TT 2 (JV-l) 



. Then the 



received signals for two consecutive OFDM blocks after CP removal and DFT transformation denoted 



by yi 



i T 



2/o,i 2/1,1 



VN-1,1 



and y2 



2/0,2 2/1,2 



yjv-1,2 



can be expressed as: 



yi = V / ^ff(DFT(IDFT(a 1 ))/ l5l + DFT(-(DFT(a 2 ))*) o cP/fa) 
+ ^^Tt(vi,i<7i " V 2 ,2* o d*92) + Wi 



n 2 P 

TTiP + 1' 

Y2 = ^^^(DFT(C(DFT(a 2 )))/ l5l + DFT(C((IDFT( ai ))*)) o d^f*g 2 ) 

+ y55r( V l,2<?l + V 2) l* O cT 2 52 ) + W 2 

where, o denotes Hadamard product, Wi = (wk y i),i = 1,2 is the AWGN at the destination and vy 
denotes the DFT of vy. Now using the identities 



(DFT(x))* = IDFT(x*) 

(IDFT(x))* = DFT(x*) 

DFT(C(DFT(x))) = x 

we get the Alamouti code form in each sub carrier k,0 < k < N — 1 as shown below: 



(39) 



Vk,i 

Vk,2 



TT^P 2 
TTlP+1 



Zk,l 


— 7* 

z k,2 




Zk,2 


z k,l 





fm 



+ 



7T 2 P 
TTlP+1 



~f$92 



vi,i(fe)gi - v 2 ,2*(fe)e « 52 
Vi, 2 (%i + v 2 ,i*(/c)e 



+ 






W k ,2 
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With the power allocation tti = 1, 7T2 = i and because of the Alamouti code form, diversity order of 
two can be achieved along with symbol-by-symbol ML decoding. 

B. Proposed Transmission Scheme 

In this section, we extend the Li-Xia transmission scheme to a general transmission scheme that can 
achieve full asynchronous cooperative diversity for arbitrary number of relays. This nontrivial extension is 
based on analyzing the sufficient conditions required on the structure of STBCs which admit application 
in the Li-Xia transmission scheme. 

1 ) Transmission by the source node: 
The source takes RN complex symbols Zij, < i < N — 1, j = 1,2, ... ,R, and forms R blocks of 



modulated by Appoint IDFT and the remaining R — M blocks are modulated by A-point DFT Without 
loss of generality, let us assume that the first M blocks are modulated by A-point IDFT. Then a CP of 
length l cp is added to each block, where l cp is not less than the maximum of the overall relative timing 
errors of the signals arrived at the destination node from all the relay nodes. The resulting R OFDM 
symbols denoted by ai , a.2 , . . • , slr eacn consisting of L s = A + l cp complex numbers are broadcasted 
to the R relays using a fraction tt\ of the total average power P. 

2) Processing at the relay nodes: 
If the channel fade gains are assumed to be constant for 2R OFDM symbol intervals, the received signals 
at the i-th relay during the j-th OFDM symbol duration is given by 



where, Vi j is the AWGN at the z-th relay node during the j-th OFDM symbol duration. The relay nodes 
process and transmit the received noisy signals as shown in Table HIT] using a fraction TT2 of total power 
P. Note from Table UTTJ that time reversal is done during the last R — M OFDM symbol durations. We 
would like to emphasize that in general time reversal could be implemented in any R — M of the total 
R OFDM symbol durations. Now, tij G {0,±rij, j = 1, ... ,R} with the constraint that the z-th relay 
should not be allowed to transmit the following: 



data denoted by aj 



z N-l,j 



, j = 1, 2, . . . , R. Of these R blocks, M of them are 




{±ry*, j = 1, . . . , M} U {±C(ry),i = 1, . . . , M} 

U {iry, j = M + 1, . . . , R} U {±C(rij*), j = M + 1, . . . , 



R}. 
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TABLE III 

Proposed transmission scheme 



OFDM Symbol 


Ui 




U M 


Um+1 




Ur 


1 


ti.i 




tM,l 


tM+1,1* 




tR,l* 
















M 


tl,M 




tM,M 


tM+l,M* 




tR,M* 


M + l 


C(tl,M+l) 




C(tM,M+l) 


C(tM+l,M+l*) 




C(tR,M+l*) 
















R 


C(tl,R) 




C(tM,R) 


C(tM,R*) 




C(tR,R*) 



Remark 10: If the i-th relay is permitted to transmit elements belonging to the above set, then after 
CP removal and DFT transformation at the destination node, we would end up with the following vectors 
corresponding to each of the four subsets in the above set respectively: 

±DFT((IDFT(aj))*) = ±DFT(DFT(aj*)), j = 1, . . . , M 
±DFT(C(IDFT( aj ))), j = l,...,M 
±DFT(DFT( aj )), j = M + 1, . . . , R 
±DFT(C((DFT(aj ))*)), j = M + 1, . . . , R 

from any of which it is not possible to recover any of ±aj,±aj*,j = 1,2, ...,R. However, if the 
destination node is allowed to apply DFT to some of the received OFDM symbols and IDFT to the 
remaining OFDM symbols, then possibly the above restrictions can be removed, which is a scope for 
further work. 

3) Decoding at the destination: 
The destination removes the CP for the first M OFDM symbols and implements the following for the 
remaining OFDM symbols: 

1) Remove the CP to get a iV-point vector 

2) Shift the last l cp samples of the N -point vector as the first l cp samples. 

DFT is then applied on the resulting R vectors. Let the received signals for R consecutive OFDM 
blocks after CP removal and DFT transformation be denoted by 



yoj yij ■■■ vn-ij 



,j = l,2,...,R. Let Wi = (wk,i),i = 1, • • • , R represent the AWGN 



at the destination node and let Vi j denote the DFT of vy. Let 
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Zk 



ZkA Zk,2 



z k,R 



,k = 0,1,... ,N- 1. 



Now using (1391 ), we get in each sub carrier k, < k < N — 1: 



where, 



Xl 



VkA Vk,2 



Biz k 



' 7T17T2-P 2 
TTiP+1" 



k n k + n k 



BjviZk BM+iZk* •BRZ k * 



(40) 



(41) 



for some square real matrices Bi,i = 1,...,R having the property that any row of Bi has only one 
nonzero entry. If u k = e jv , then 



figi u T k 2 f 2 g 2 ■■■ u k M f M 9M u k M+1 f* M+l g M +i ••• u T kf* R gR 



(42) 



is the equivalent channel matrix for the A;-th sub carrier. The equivalent noise vector n k is given by: 



n k 



TT 2 P 
TTlP+1 



where, sgn(tij) 



Vi 



A Ei=i sff«(ti,i)vi,i(A;)5i'"fc 
Ei=i • s 5 f «(ti,2)vi )2 (fe)5'^fc 

1 if ty G {r u ,j = l,...,fl} 
-1 if ^€{-^,^ = 1,...,^} and 
[ if ty = 
ztvij if i < M and ti jm = irjj 





Wk,l 




Wk,2 


+ 




U>k,R 



. The /3j's are simply scaling factors to account for the 

ztvij* if i > M and t i m = ±r ; j 
correct noise variance due to some zeros in the transmission. 

ML decoding of X k can be done from (1401 by choosing that codeword which minimizes || (y k — 
X k h k ) |||., where Q is the covariance matrix of n k . Essentially, the proposed transmission scheme 
implements a space time code having a special structure in each sub carrier. 

4) Full diversity four group decodable DSTBCs: 
In this subsection, we analyze the structure of the STBC required for implementing in the proposed 
transmission scheme. Then we observe that the DSTBCs constructed in Section [III] have this structure 
and hence are applicable in this setting as well. Note from (|4~TI) that the conjugate linearity property is 
required. But conjugate linearity alone is not enough for a space time code to qualify for implementation 
in the proposed transmission scheme. Note from Table [III] that time reversal is implemented for certain 
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OFDM symbol durations by all the relay nodes. In other words if one relay node implements time reversal 
during a particular OFDM symbol duration, then all the other relay nodes should necessarily implement 
time reversal during that OFDM symbol duration. Observe that this is a property connected with the row 
structure of a space time code. We now provide a set of sufficient conditions that are required on the row 
structure of conjugate linear space time codes. First let us partition the complex symbols appearing in the 
i-th row into two sets- one set Pj containing those complex symbols which appear without conjugation 
and another set Pf which contains those complex symbols which appear with conjugation in the i-th 
row. Any conjugate linear STBC satisfying the following sufficient conditions can be implemented in the 
proposed OFDM based transmission scheme described in the previous subsection. 

P t n Pf = <j), V i = l,...,R 

|P f | = |Pf|, Wi = l,...,R (43) 

pnPj e {<f>,Pi,Pj}, Vi^j. 

To understand what happens if the above condition is not met, let us see an example of a conjugate 
linear STBC which cannot be employed in the proposed transmission scheme. 
Example 4.1: Consider the conjugate linear STBC given by 





Zk,2 


4,3 


— 7* 

z k,A 


Zk,2 


Zk,3 


~ z k,A 


* 

-4,i 


Zk,3 


z k,A 


4,1 


7* 

z k,2 


Zk,4 


Zk,l 


z k,2 


Z k,3 



for which P 1 = P 3 C = {z kA , z k ^}, Pi = Pi = {^,3,^,4}. P2 = Pi = j>fc,2,z fcj3 }, P 2 C = P 4 = 
{zfc,4> z k,i}- It can t> e checked that there is no assignment of time reversal OFDM symbol durations 
together with an appropriate choice of M and relay node processing such that the above conjugate linear 
STBC form is obtained in every sub carrier at the destination node. This is because the conditions in 
(l43l are not met by this conjugate linear STBC. 

For the case of the Alamouti code, Pi = P| = {z k ,i}, P2 = Pf = {^,2} and hence it satisfies the 
conditions in (143b . 

It is easy to observe that the four group ML decodable DSTBCs constructed in Section HIT] satisfy all 
the required conditions as stated in (|43l and are thus suitable for application in the proposed transmission 
scheme. This is illustrated using the following two examples. 

Example 4.2: Let us consider R = 4 and the DSTBC from extended Clifford algebra A| for this case 
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has the following structure 

z k,l z k,2 ~ z k,3 ~ Z l,4 
Zk,2 Zk,l -4,4 ~ Z l,3 
z k,3 Zk,4 4,1 Z t,2 
_ z k,4 Zk,3 4,2 Z k,l 

for which M = 2, P 1 = P 2 = P§ = P 4 C = {z k ,i, z k>2 } and P 3 = P A = Pf = P 2 C = {z k , 3 , z k ,i}. To arrive 
at the above structure in every sub carrier, encoding and processing at the relays are done as follows: 
ai = IDFT(ai), a 2 = IDFT(a 2 ), a 3 = DFT(a 3 ) and a 4 = DFT(a 4 ). 



TABLE IV 

Transmission scheme for 4 relays 



OFDM 
Symbol 


U-L 


V a 


Uz 


Ui 


1 


r l,l 


r 2 ,2 


-r 3 ,3* 


-r 4 ,4* 


2 


ri,2 


r 2 ,i 


-r 3 ,4* 


-r 4 ,3* 


3 


C(n,a) 


C(ra,*) 


C(rs,i*) 


C(r 4 ,2*) 


4 


C(n, 4 ) 


C(ra,a) 


-C(ra,a*) 


-C(r 4 ,i*) 



As discussed in Section ITTTl this DSTBC is single complex symbol decodable and achieves full diversity 
for appropriately chosen signals sets . 

Example 4.3: Let us take R = 5, for which the DSTBC is obtained by taking a DSTBC from PCIOD 
for 6 relays and dropping one column. It is given by 



2fc,l 


— 7* 

z k,2 











Zk,2 


* 

Z k,l 

















Zk,3 


~ z k,4 











Zk,4 


7* 

z k,3 

















z k 5 














Zk,i 



for which P 1 = P 2 C = {z kjl }, P 2 = Pf = {z k>2 }, P 3 = P! = {z k ,3}, Pi = PI = {z k ,i}, P5 = {z k , 5 }, 
Pq = {^,6} an d P§ = P£ = 4>- At the source, we choose ai = IDFT(ai), a 2 = DFT(a 2 ), a 3 = 
IDFT(a 3 ), a 4 = DFT(a 4 ), a 5 = IDFT(a 5 ) and a 6 = DFT(a 6 ). The 5 relays process the received 
OFDM symbols as shown in Table [V] 

This code is 3 real symbol decodable and achieves full diversity for appropriately chosen signal sets. 
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TABLE V 

Transmission scheme for 5 relays 



OFDM 
Symbol 


E/i 


u 2 


U a 


u t 


u 5 


1 


r i,i 


-r 2 ,2* 











2 


C(ri,a) 


C(r 2 ,i*) 











3 








r3,3 


-r 4 ,4* 





4 








-C(rs,a) 


C(r 4 ,i*) 





5 














re, 5 


6 














-C(r 5l e) 



Example 14.31 illustrates how the proposed transmission scheme can be employed for odd number 
of relays as well. Note that the ML decoding complexity of the proposed codes for asynchronous 
relay networks is significantly less compared to all other distributed space time codes for asynchronous 
cooperative diversity known in the literature. 

Note that the full diversity, 2-group ML decodable DSTBCs in [37] and the full diversity, 1-group ML 
decodable DSTBCs in [35], [34] also satisfy the conditions in (l43l and are applicable in the proposed 
OFDM based transmission scheme. 



C. Transmission Scheme for Noncoherent Asynchronous Relay Networks 

In this subsection, it is shown how differential encoding can be combined with the proposed transmis- 
sion scheme described in Subsection llV-Bl and the distributed differential space time codes for noncoherent 
synchronous relay networks in [54] are proposed for application in this setting. 

For the transmission scheme described in Subsection IIV-BI at the end of one transmission frame, we 



have in the k-th sub carrier = J ™p+\ Xkhk + rik. Note that the channel matrix hk depends on 
fi,gi,Ti,i = 1, . . . ,R. Thus the destination node needs to have the knowledge of all these values in order 
to perform ML decoding. 

Now using differential encoding ideas which were proposed in [55], [56], [57] for non-coherent com- 
munication in synchronous relay networks, we propose to combine them with the proposed asynchronous 
transmission scheme. Supposing the channel remains approximately constant for two transmission frames, 
then differential encoding can be done at the source node in each sub carrier < k < N — las follows: 

ag=[ VR ... 1 T , a k = — ^-Cta^CtG^ 

L J t — L 
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where, a k denotes the vector of complex symbols transmitted by the source during the 2-th transmission 
frame and c € is the codebook used by the source which consists of scaled unitary matrices C^Ct = b\ I 
such that E[b 2 t ] = 1. If CB ; = B.Ci = 1,...,M and CB ; = BiC*, i = M + 1, . . . , R for all Ce?, 
then we can write 



9t-l 



(44) 



from which C t can be decoded as C t = axgminc t6 .g> || y k — |^7Cty k 1 ||^ in each sub carrier < 
k < N- 1. 

Note that this decoder does not require the knowledge of fi,gi,Ti,i = 1,...R at the destination 
although this transmission strategy assumes the knowledge of the maximum of the r's since it is needed 
to decide the length of CP. It turns out that the four group decodable distributed differential space time 
codes constructed in [54] for synchronous relay networks with power of two number of relays meet all 
the requirements for use in the proposed transmission scheme as well. Let us see an example to illustrate 
this fact. 



Example 4.4: Let R = 4. The codebook at the source is given by 
( 

-Zo Za 



= < 



Z\ Z2 —-63 

Z 2 Z\ -z% 

Z 3 Zi z\ 

z 4 z 3 z% 



1 




1 





-Zo 



> where {z u , z 2 i} , {z 1Q , z 2 q} , {z 3I , z 4I } , {z 3Q , z 4Q } G S and 



for each sub carrier 0</e<iV — las follows 



■. Differential encoding is done at the source node 



a 



y/R 



a k 



Once we get a k , k = 0, . . . , N — 1 from the above equation, the N length vectors Z{,i = 1, . . . , R can 
be obtained. Then IDFT/DFT is applied on these vectors and broadcasted to the relay nodes as shown 
below: a x = IDFT(ai), a 2 = IDFT(a 2 ), a 3 = DFT(a 3 ) and a 4 = DFT(a 4 ). The relay nodes process 
the received OFDM symbols as given below: 
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(Jt DM 

Symbol 


T J 


7" r 

L>2 


T J 


U4 


1 


r l,l 


r2,2 


-r3,3 


-r 4 ,4 


2 


ri,2 


r 2 ,i 


-r 3 ,4* 


-r 4 ,3* 


3 


C(ri,s) 


C(r 2 ,4) 


C(r 3 ,i*) 


CK2*) 


4 


C(ri,4) 


C(r 2>3 ) 


-C(r 3 ,2*) 


-C(r 4 ,i*) 



for which M = 2, B x = I 4 , B 2 



10 
10 
1 
10 







1 
1 



-1 








-1 






and 



-1 
0-1 
10 



. It has been proved in [54] that CB ; = BiC, i = 1, 2 and 



10 

CBi = BiC*, i = 3, 4 for all C 6?. At the destination node, decoding for {zu,Z2i}, {z\q,Z2q}, 
{z3i,Z4j} and {zsq,Z4q} can be done separately in every sub carrier due to the four group decodable 
structure of ^ . 

V. Simulation Results 

In this section, we study the error performance of the DSTBCs proposed in this paper using simulations. 
We consider both the synchronous case and the asynchronous case. 

For the synchronous case, we compare the performance of the newly proposed DSTBCs from extended 
Clifford algebras and PCIODs with the DSTBCs from field extensions 



The PCIOD taken for simulations is given by 



Z\ 










Z2 


z l 
















-4 








z 4 


4 



34], [35] for a 4 relay network, 
where, {zu,z 3I }, {z 1q ,z 3 q}, 



{Z21, Z41}, {z2Q, Z4q} are allowed to take values from QAM constellation rotated by 31.718°. The DSTBC 
from extended Clifford algebras (ECA) is obtained from A| and is given by 



Zl 


Z2 


-4 


-Z4 


Z2 


Zl 


-4 


— 7* 


Z3 


Z4 


z i 


7* 


Z4 


Z3 


* 
z 2 
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Total average power spent by source and relays, P (dB) 



Fig. 6. Performance comparison of DSTBCs from PCIOD, ECA and field extension 



21 


IZ4 


IZ 3 


IZ2 


Z2 


Zl 


IZ4 


iz 3 


Z3 


Z2 


Zl 


iz^ 




Z3 


Z2 


Zl 



where, {zu, Z21}, {ziq, Z2q], {z 3 i, z 3 q], {z^j, z^q} are allowed to take values from QAM constellation 



rotated by 166.71°. The DSTBC from field extensions [34], [35] is given by 



where z%, i = 1, 2, 3, 4 are allowed to take values from regular QAM constellation. 

Figj6] shows the codeword error rate performance of the proposed DSTBCs (4 relays) in comparison 
to those from field extensions [34], [35] for transmission rates of 1 bit per channel use (bpcu) and 2 
bpcu. We observe that the error performance of the proposed codes are very similar to the 1-group 
ML decodable DSTBC from field extensions [34], [35]. Thus the proposed codes enjoy a good error 
performance along with reduced ML decoding complexity. 

For the asynchronous case, we take R = 4, N = 64 and the length of CP as 16. The delay 73 at 
each relay is chosen randomly between to 15 with uniform distribution. Two cases are considered 
for simulation: (i) with channel knowledge at the destination and (ii) without channel knowledge at any 
node. When channel knowledge is available at the destination, the processing at relay nodes is done as 
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Fig. 7. Error performance for an asynchronous 4 relay network with and without channel knowledge 



described in Example 14.21 and 166.71° rotated QPSK is used as the signal set. Coherent detection is 
done at the destination in every sub carrier. For the case of no channel knowledge, differential encoding 
is done at the source as described in Example 14.41 and a modified signal set (as explained in [54] for 
scaled unitary codewords) is employed. Also, differential detection is done at the destination in every 
sub carrier. Simulations are done for transmission rates (neglecting rate loss due to CP) of 1 bpcu and 2 
bpcu in both the cases. 

The error performance curves for both the cases are shown in Fig. [7] It can be observed that the error 
performance for the no channel knowledge case performs approximately 5 dB worse and 8 dB worse 
compared to that with channel knowledge for transmission rates of 1 bpcu and 2 bpcu respectively. This 
is partly due to the differential transmission/reception technique (which increases the effective noise seen 
by destination) and also in part, because of the change in signal set from rotated QAM to some other 
signal set [54] for scaled unitary codeword matrices. The change in signal set for the sake of scaled 
unitary codeword matrices results in a reduction of the coding gain. 
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VI. Discussion 

In this paper, the problem of optimal rate-ML decoding complexity within the framework of multi-group 
decodable STBCs was posed. Then an algebraic framework for studying CUW STBCs was introduced 
using which the optimal rate-ML decoding complexity tradeoff of CUW STBCs was obtained and several 
optimal code constructions were also provided. Then the paper focused on constructing multi-group 
decodable DSTBCs and three new classes of four group decodable DSTBCs were constructed. The 
OFDM based transmission scheme in [39] was extended to a more general transmission scheme for 
arbitrary number of relays that can achieve full cooperative diversity in the presence of timing errors 
at the relay nodes. It was then pointed out that the four group decodable DSTBCs constructed in this 
paper can be applied in the proposed transmission scheme for any number of relay nodes. A drawback 
of the proposed transmission scheme is that it requires a large coherence interval spanning over multiple 
OFDM symbol durations. Moreover there is a rate loss due to the use of CP, however this loss can 
be made negligible by choosing a large enough N. Finally, it was shown how differential encoding 
at the source node can be combined with this OFDM based transmission strategy to arrive at a new 
transmission strategy than can achieve full cooperative diversity in asynchronous relay networks with 
no channel knowledge as well as no timing error knowledge. The distributed differential STBCs in [54] 
were then proposed for application in this setting for power of two number of relays. 

Some of the interesting directions for further work are as listed below: 

1) The CUW STBCs are based on sufficient conditions for g-group ML decodability. An algebraic 
framework for g-group ML decodable STBCs based on the necessary and sufficient conditions and 
the optimal rate-ML decoding complexity tradeoff of general g-group ML decodable STBCs is an 
important open problem. 

2) How to construct g-group decodable DSTBCs for g ^ 4? In particular, constructing single symbol 
decodable DSTBCs for the synchronous as well as asynchronous cases is worth investigating. Some 
initial results in this direction have been reported in [60], [61]. 

3) In the results pertaining to asynchronous relay networks, we have assumed that there are no 
frequency offsets at the relay nodes. Extending this work to asynchronous relay networks with 
frequency offsets is an interesting direction for further work. This problem has been addressed in 
[62] for the case of two relay nodes. 

4) In this work, we have constructed DSTBCs with low ML decoding complexity mainly for the two 
phase amplify and forward based transmission protocols [33], [36], [49]. Constructing low ML 
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decoding complexity codes for the other cooperative diversity protocols in the literature is also an 
interesting problem. 
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